Master StudentI am Jian Lang, currently a master candidate in Software Engineering at the University of Electronic Science and Technology of China (UESTC), under the supervision of Prof Fan Zhou. Before that, I received my Bachelor of Engineering degree from Fuzhou University.
My research mainly focuses on robust, reliable, and stable multimodal systems that can perform effectively under missing modalities, distribution (domain) shifts, weak supervision (label scarcity), and data scarcity. And I am also interested in video analysis, detection, and large multimodal models for some applications.
Feel free to contact me if you have any questions about my research or potential collaboration opportunities.
") does not match the recommended repository name for your site ("").
", so that your site can be accessed directly at "http://".
However, if the current repository name is intended, you can ignore this message by removing "{% include widgets/debug_repo_name.html %}" in index.html.
",
which does not match the baseurl ("") configured in _config.yml.
baseurl in _config.yml to "".

Jian Lang, Rongpei Hong, Ting Zhong, Yong Wang, Fan Zhou† († corresponding author)
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2026 CCF-A Robustness Multimodal Learning Video Analysis & Detection
We propose RADAR, a novel retrieval-augmented distribution alignment and target-aware self-training framework that, for the first time, enables robust test-time adaptation for fake news video detection under drastic topic-level distribution shifts.
Jian Lang, Rongpei Hong, Ting Zhong, Yong Wang, Fan Zhou† († corresponding author)
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2026 CCF-A Robustness Multimodal Learning Video Analysis & Detection
We propose RADAR, a novel retrieval-augmented distribution alignment and target-aware self-training framework that, for the first time, enables robust test-time adaptation for fake news video detection under drastic topic-level distribution shifts.

Jian Lang, Rongpei Hong, Ting Zhong, Leiting Chen, Qiang Gao, Fan Zhou† († corresponding author)
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2026 CCF-A Robustness Multimodal Learning Large Multimodal Model
We propose ALARM, a label-free harmful meme detection framework powered by Large Multimodal Model self-improvement, which mitigates label scarcity and enables prompt and robust adaptation to evolving harmful content in web memes.
Jian Lang, Rongpei Hong, Ting Zhong, Leiting Chen, Qiang Gao, Fan Zhou† († corresponding author)
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2026 CCF-A Robustness Multimodal Learning Large Multimodal Model
We propose ALARM, a label-free harmful meme detection framework powered by Large Multimodal Model self-improvement, which mitigates label scarcity and enables prompt and robust adaptation to evolving harmful content in web memes.

Rongpei Hong, Jian Lang, Ting Zhong†, Yong Wang, Fan Zhou († corresponding author)
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2026 CCF-A Large Multimodal Model
We propose TAME, a novel training-free and state-aware personalized Multimodal Large Multimodal Model (MLLM) assistant powered by double memories.
Rongpei Hong, Jian Lang, Ting Zhong†, Yong Wang, Fan Zhou († corresponding author)
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2026 CCF-A Large Multimodal Model
We propose TAME, a novel training-free and state-aware personalized Multimodal Large Multimodal Model (MLLM) assistant powered by double memories.

Jiao Li, Jian Lang, Xikai Tang†, Ting Zhong, Fan Zhou († corresponding author)
The Association for the Advancement of Artificial Intelligence (AAAI) 2026 CCF-A Robustness Multimodal Learning Video Analysis & Detection
We propose SCANNER, the first test-time adaptation framework tailored for distribution shifting hate video detection.
Jiao Li, Jian Lang, Xikai Tang†, Ting Zhong, Fan Zhou († corresponding author)
The Association for the Advancement of Artificial Intelligence (AAAI) 2026 CCF-A Robustness Multimodal Learning Video Analysis & Detection
We propose SCANNER, the first test-time adaptation framework tailored for distribution shifting hate video detection.

Jian Lang, Rongpei Hong, Zhangtao Cheng, Yong Wang, Ting Zhong, Fan Zhou† († corresponding author)
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2025 CCF-A Robustness Multimodal Learning
We propose REDEEM, the extension work of our RAGPT accetped to AAAI 2025, a novel framework that pioneers a retrieval-guided conditional generation paradigm for enhancing the robustness of pre-trained Multimodal Transformer.
Jian Lang, Rongpei Hong, Zhangtao Cheng, Yong Wang, Ting Zhong, Fan Zhou† († corresponding author)
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2025 CCF-A Robustness Multimodal Learning
We propose REDEEM, the extension work of our RAGPT accetped to AAAI 2025, a novel framework that pioneers a retrieval-guided conditional generation paradigm for enhancing the robustness of pre-trained Multimodal Transformer.

Jian Lang*, Zhangtao Cheng*, Ting Zhong, Fan Zhou† (* equal contribution, † corresponding author)
The Association for the Advancement of Artificial Intelligence (AAAI) 2025 CCF-A Robustness Multimodal Learning
We propose RAGPT, a novel retrieval-augmented dynamic prompt-tuning framework for enhancing the robustness of pre-trained Multimodal Transformer under modality missing conditions.
Jian Lang*, Zhangtao Cheng*, Ting Zhong, Fan Zhou† (* equal contribution, † corresponding author)
The Association for the Advancement of Artificial Intelligence (AAAI) 2025 CCF-A Robustness Multimodal Learning
We propose RAGPT, a novel retrieval-augmented dynamic prompt-tuning framework for enhancing the robustness of pre-trained Multimodal Transformer under modality missing conditions.

Jian Lang, Rongpei Hong, Jin Xu, Xovee Xu, Yili Li, Fan Zhou† († corresponding author)
The Web Conference (WWW) 2025 CCF-A Video Analysis & Detection
We introduce MoRE (Mixture of Retrieval-augmented multimodal Experts), a novel framework designed to enhance short video hate detection.
Jian Lang, Rongpei Hong, Jin Xu, Xovee Xu, Yili Li, Fan Zhou† († corresponding author)
The Web Conference (WWW) 2025 CCF-A Video Analysis & Detection
We introduce MoRE (Mixture of Retrieval-augmented multimodal Experts), a novel framework designed to enhance short video hate detection.

Rongpei Hong*, Jian Lang*, Ting Zhong, Fan Zhou† (* equal contribution, † corresponding author)
In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2025 CCF-A Video Analysis & Detection
We propose CRAVE, a novel CRoss-domAin retrieVal augmEntation framework that transfers knowledge from resource-rich image-text domain to enhance malicious video detection.
Rongpei Hong*, Jian Lang*, Ting Zhong, Fan Zhou† (* equal contribution, † corresponding author)
In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV) 2025 CCF-A Video Analysis & Detection
We propose CRAVE, a novel CRoss-domAin retrieVal augmEntation framework that transfers knowledge from resource-rich image-text domain to enhance malicious video detection.

Zhangtao Cheng*, Jian Lang*, Ting Zhong, Fan Zhou† (* equal contribution, † corresponding author)
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2025 CCF-A Robustness Multimodal Learning Video Analysis & Detection
We propose SCRAG, a novel Self-Correlation Retrieval-Augmented Generative framework designed to enhance missing-modality robustness in micro-video popularity prediction.
Zhangtao Cheng*, Jian Lang*, Ting Zhong, Fan Zhou† (* equal contribution, † corresponding author)
ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD) 2025 CCF-A Robustness Multimodal Learning Video Analysis & Detection
We propose SCRAG, a novel Self-Correlation Retrieval-Augmented Generative framework designed to enhance missing-modality robustness in micro-video popularity prediction.

Rongpei Hong, Jian Lang, Jin Xu, Zhangtao Cheng, Ting Zhong†, Fan Zhou († corresponding author)
The Web Conference (WWW) 2025 CCF-A Video Analysis & Detection
In this work, we introduce ExMRD, a novel Explainable Micro-video Rumor Detection (MVRD) framework designed to generate detailed and coherent explanations for enhancing MVRD.
Rongpei Hong, Jian Lang, Jin Xu, Zhangtao Cheng, Ting Zhong†, Fan Zhou († corresponding author)
The Web Conference (WWW) 2025 CCF-A Video Analysis & Detection
In this work, we introduce ExMRD, a novel Explainable Micro-video Rumor Detection (MVRD) framework designed to generate detailed and coherent explanations for enhancing MVRD.

Yili Li, Jian Lang, Rongpei Hong, Qing Chen, Zhangtao Cheng, Jia Chen, Ting Zhong, Fan Zhou† († corresponding author)
IEEE International Conference on Multimedia & Expo (ICME) 2025 CCF-B Video Analysis & Detection
We propose a novel model-agnostic framework REAL that generates manipulation-aware representations to enhance existing methods in detecting fake news videos.
Yili Li, Jian Lang, Rongpei Hong, Qing Chen, Zhangtao Cheng, Jia Chen, Ting Zhong, Fan Zhou† († corresponding author)
IEEE International Conference on Multimedia & Expo (ICME) 2025 CCF-B Video Analysis & Detection
We propose a novel model-agnostic framework REAL that generates manipulation-aware representations to enhance existing methods in detecting fake news videos.

Ting Zhong, Jian Lang, Yifan Zhang, Zhangtao Cheng†, Kunpeng Zhang, Fan Zhou († corresponding author)
Special Interest Group on Information Retrieval (SIGIR) 2024 CCF-A Video Analysis & Detection
We present MMRA, a multi-modal retrieval-augmented popularity prediction model that enhances prediction accuracy using relevant retrieved information.
Ting Zhong, Jian Lang, Yifan Zhang, Zhangtao Cheng†, Kunpeng Zhang, Fan Zhou († corresponding author)
Special Interest Group on Information Retrieval (SIGIR) 2024 CCF-A Video Analysis & Detection
We present MMRA, a multi-modal retrieval-augmented popularity prediction model that enhances prediction accuracy using relevant retrieved information.