Yongheng Zhang

Tencent

World Models

Large Language Models

Reasoning & Chain-of-Thought

About

I am Yongheng Zhang, a Vibe Researcher. Currently, my research focuses on World Models and Large Language Models, with specific interests as follows:

🌏World Models: The goal of World Models for Video Understanding and Generation is to build agents that perceive, reason, and predict through dynamic visual experiences. By learning structured video representations, they enable accurate understanding, long-term prediction, and creative generation, paving the way for intelligent interaction and real-world simulation.
✈️Large Language Models: Large Language Model Reasoning aims to empower language models with structured and logical thinking beyond pattern matching. By enabling multi-step inference and problem decomposition, it enhances reliability and interpretability, supporting complex decision-making, scientific discovery, and human-level cognition.

If you are interested in my research, feel free to contact me:

Email: zyhbrz Wechat: NHistory

Youtu Lab, Tencent, Shanghai, China

Qingyun Plan, Research Intern

2026.03 - Present

Tencent, Beijing, China

Research Intern

2025.09 - 2026.01

2026-03

🎉🎉 I have joined the Youtu Lab @ Tencent.

2025-11

🔥🔥 Our DaP-ICoT is accepted by AAAI 2026.

2025-10

🎉🎉 I have been awarded the National Scholarship, China, TOP 0.2%.

2025-09

🎉🎉 I have been awarded the First-Class Academic Scholarship, Ranking TOP 1!

2025-09

🎉🎉 I have joined the Basic Algorithm Center, PCG @ Tencent.

2025-07

🔥🔥 Our ViTCoT is accepted by ACM MM 2025 (Oral).

2025-05

🔥🔥 Our CCHall is accepted by ACL 2025 Main.

2025-03

🎉🎉 Our MDCoT is accepted by ICME 2025 (Oral).

2024-10

🎉🎉 Our WoT is accepted by EMNLP 2024 Findings.

2024-08

🎉🎉 Our S3Agent is accepted by ToMM.

2024-05

🎉🎉 Our Auto-CAP is accepted by ACL 2024 Findings.

2023-12

🎉🎉 Our LabCLIP is accepted by ICASSP 2024.