Publications
A collection of my research work.
ViTCoT: Video-Text Interleaved Chain-of-Thought for Boosting Video Understanding in Large Language Models
Yongheng Zhang*, Xu Liu*, Ruihan Tao, Qiguang Chen, Hao Fei, Wanxiang Che, Libo Qin
ACM MM 2025 (Oral)
A novel video reasoning paradigm that enables intuitive aligned reasoning through video-text interleaved CoT.
CCHall: A Novel Benchmark for Joint Cross-Lingual and Cross-Modal Hallucinations Detection in Large Language Models
Yongheng Zhang*, Xu Liu*, Ruoxi Zhou, Qiguang Chen, Hao Fei, Wenpeng Lu, Libo Qin
ACL 2025 (Main)
A benchmark for detecting cross-lingual and cross-modal hallucinations in LLMs.
Let's Think with Images Efficiently! An Interleaved-Modal Chain-of-Thought Reasoning Framework
Yongheng Zhang*, Xu Liu*, Qiguang Chen, Yao Li, Sheng Wang, Libo Qin
AAAI 2026
An efficient interleaved-modal chain-of-thought reasoning framework with dynamic and precise visual thoughts.
AutoCAP: Towards Automatic Cross-lingual Alignment Planning for Zero-shot Chain-of-Thought
Yongheng Zhang*, Qiguang Chen*, Min Li, Wanxiang Che, Libo Qin
ACL 2024 (Findings)
A method for automatic cross-lingual alignment planning enabling zero-shot chain-of-thought across languages.
Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information
Yongheng Zhang, Qiguang Chen, Jingxuan Zhou, Peng Wang, Jiasheng Si, Jin Wang, Wenpeng Lu, Libo Qin
EMNLP 2024 (Findings)
A reasoning framework that integrates multi-perspective verification and leverages wrong information to improve reasoning.
LabCLIP: Label-Enhanced Clip for Improving Zero-Shot Text Classification
Yongheng Zhang, Peng Wang, Qiguang Chen, Jingxuan Zhou, Yongmei Wang, Min Li, Libo Qin
ICASSP 2024
A label-enhanced CLIP approach for zero-shot text classification that considers both visual image and text label semantic information.
MDCoT: Medical Diagnosis Chain-of-Thought with Self-Diagnostic Refinement for Alzheimer's Disease
Chunlin Lu*, Yongheng Zhang*, Peng Wang*, Wenpeng Lu, Libo Qin
ICME 2025 (Oral)
A chain-of-thought medical diagnosis framework with self-diagnostic refinement for Alzheimer's disease.
LAMA-AD: Label-Aware Multi-Agent Alzheimer's Disease Diagnosis with Counterfactual Reasoning
Chunlin Lu*, Yongheng Zhang*, Yao Li, Sheng Wang, Libo Qin
BIBM 2025 (Oral)
A multi-agent system with counterfactual reasoning for Alzheimer's disease diagnosis.
S3 Agent: Unlocking the Power of VLLM for Zero-Shot Multi-modal Sarcasm Detection
Peng Wang, Yongheng Zhang, Hao Fei, Qiguang Chen, Yukai Wang, Jiasheng Si, Wenpeng Lu, Min Li, Libo Qin
ToMM 2024
A multi-view agent framework leveraging vision-language models for zero-shot multi-modal sarcasm detection.
MvDDI: A Multi-view Interaction Framework for Few-shot Drug-Drug Interaction
Zihao Mao, Qiguang Chen, Yongheng Zhang, Ruoxi Zhou, Peng Wang, Yao Li, Sheng Wang, Libo Qin
NLPCC 2025 (Best Poster Award)
Multi-view framework for few-shot drug-drug interaction prediction. Won Best Poster Award at NLPCC 2025.
Large Language Models Meet NLP: A Survey
Libo Qin, Qiguang Chen, Xiachong Feng, Yang Wu, Yongheng Zhang, Yinghui Li, Min Li, Wanxiang Che, Philip S. Yu
FCS 2025
A comprehensive survey on the intersection of large language models and NLP.
Single-to-Multiple: Learning Multiple Intent Detection with Only Single Intent Data
Jingxuan Zhou, Shijue Huang, Weiyun Wang, Qiguang Chen, Yongheng Zhang, Tianbao Xie, Libo Qin
Data Intelligence 2026
A method for learning multiple intent detection using only single intent training data.