About
I am Yongheng Zhang, a Vibe Researcher. Currently, my research focuses on World Models and Large Language Models, with specific interests as follows:
- ๐World Models: The goal of World Models for Video Understanding and Generation is to build agents that perceive, reason, and predict through dynamic visual experiences. By learning structured video representations, they enable accurate understanding, long-term prediction, and creative generation, paving the way for intelligent interaction and real-world simulation.
- โ๏ธLarge Language Models: Large Language Model Reasoning aims to empower language models with structured and logical thinking beyond pattern matching. By enabling multi-step inference and problem decomposition, it enhances reliability and interpretability, supporting complex decision-making, scientific discovery, and human-level cognition.
If you are interested in my research, feel free to contact me:
Email: zyhbrz ย ย Wechat: NHistory
Research Experience
News
๐๐ I have joined the Youtu Lab @ Tencent.
๐ฅ๐ฅ Our DaP-ICoT is accepted by AAAI 2026.
๐๐ I have been awarded the National Scholarship, China, TOP 0.2%.
๐๐ I have been awarded the First-Class Academic Scholarship, Ranking TOP 1!
๐๐ I have joined the Basic Algorithm Center, PCG @ Tencent.
๐ฅ๐ฅ Our ViTCoT is accepted by ACM MM 2025 (Oral).
๐ฅ๐ฅ Our CCHall is accepted by ACL 2025 Main.
๐๐ Our MDCoT is accepted by ICME 2025 (Oral).
๐๐ Our WoT is accepted by EMNLP 2024 Findings.
๐๐ Our S3Agent is accepted by ToMM.
๐๐ Our Auto-CAP is accepted by ACL 2024 Findings.
๐๐ Our LabCLIP is accepted by ICASSP 2024.
