About Me

Hi, There๐Ÿ‘‹! I am an undergraduate student at Harbin Institute of Technology (Shenzhen), working on Trustworthy Multimodal AI and Adaptive, Data-Efficient Learning.

My previous research focuses on robust and reliable multimodal model adaptation under distribution shift, especially for test-time adaptation and hallucination mitigation in vision-language systems.

I am currently diving into world models and embodied AI, aiming to help build more intelligent and capable robotic systems.

News

  • 2026.05: ย ๐ŸŽ‰๐ŸŽ‰ Two paper were accepted by ICML 2026, Congrats !! ๐Ÿฅณ
  • 2026.02: ย ๐ŸŽ‰๐ŸŽ‰ One paper โ€œDo All Individual Layers Help?โ€, was accepted by CVPR 2026 Findings.
  • 2026.02: ย ๐ŸŽ‰๐ŸŽ‰ One paper โ€œTest-Time Distillation for Continual Model Adaptationโ€, was accepted by CVPR 2026 Findings.
  • 2025.11: ย ๐ŸŽ‰๐ŸŽ‰ Recognized as one of the โ€˜Top Ten Outstanding College Studentsโ€™ at Harbin Institute of Technology (Shenzhen)
  • 2024.10: ย ๐ŸŽ‰๐ŸŽ‰ Awarded the Chinese National Scholarship.

Publications

ICML 2026
AFIP overview

Correcting Visual Blur Induced by Attention Distraction to Reduce Hallucinations: Algorithm and Theory

Quanjiang Liโ€ , Zhiming Liuโ€ , Wei Luo, Tingjin Luo, Chenping Hou

  • We identify the link between human-like attention distraction and object hallucinations in multimodal models, and propose AFIP, a training-free method that corrects spatial and temporal attention dispersion to enhance visual grounding without additional training.
  • โ€  indicates equal contribution (co-first authors).
ICML 2026
MOON overview

Von Mises-Fisher Mixture Model with Dynamic Shrinkage for Realistic Test-Time Transduction

Jiazhen Huang, Zhiming Liu, Changhu Wang, Wei Ju, Ziyue Qiao, Xiao Luo

  • We identify the brittleness of transductive methods under imbalanced distributions and propose MOON, a training-free, model-agnostic framework that dynamically adjusts shrinkage strength to mitigate negative transfer and enhance VLM performance without retraining.
CVPR 2026
paper-1

Do All Individual Layers Help? An Empirical Study of Task-Interfering Layers in Vision-Language Models

Zhiming Liu, Yujie Wei, Lei Feng, Xiu Su, Xiaobo Xia, Weili Guan, Zeke Xie, Shuo Yang

  • We identify task-interfering layers in vision-language models and propose a lightweight test-time intervention strategy that improves downstream few-shot reasoning without retraining.
CVPR 2026
paper-2

Test-Time Distillation for Continual Model Adaptation

Xiao Chenโ€ , Jiazhen Huangโ€ , Zhiming Liu, Qinting Jiang, Fanding Huang, Jingyan Jiang, Zhi Wang

  • We propose a collaborative test-time distillation framework for continual model adaptation that improves robustness and generalization under realistic distribution shifts.
  • โ€  indicates equal contribution (co-first authors).
Under Review
paper-1

Adaptive Disentangled Representation Learning for Incomplete Multi-View Multi-Label Classification

Quanjiang Liโ€ , Zhiming Liuโ€ , TianxiangXuโ€ , Tingjin Luo, Chenping Hou

  • We proposed ADRL, a novel framework that jointly addresses structural distortion and semantic ambiguity in incomplete multi-view settings by integrating label-guided feature disentanglement and category-aware embedding interaction.
  • โ€  indicates equal contribution (co-first authors).

Honors and Awards

  • 2024: Finalist Award in the Mathematical Contest in Modeling (MCM)
  • 2024: Chinese National Scholarship
  • 2024: First Prize Scholarship at Harbin Institute of Technology (Shenzhen)
  • 2025: National Second Prize, Global Campus Artificial Intelligence Algorithm Elite Competition 2025
  • 2025: Top Ten Outstanding College Students of Harbin Institute of Technology (Shenzhen)
  • 2025: First Prize Scholarship at Harbin Institute of Technology (Shenzhen)

Research Project

May 2025 - Nov 2025
Task-Interfering Layer Optimization for Test-Time Adaptation of Multimodal Large Language Models
Leader
Nov 2025 - Jan 2026
Correcting Visual Blur Induced by Attention Distraction to Reduce Hallucinations: Algorithm and Theory
Leader
Jan 2025 - Jun 2025
Adaptive Disentangled Representation Learning for Incomplete Multi-View Multi-Label Classification
Leader
Aug 2025 - Nov 2025
CoDiRe: Collaborative Test-Time Distillation for Robust Domain Generalization
Core Member
Nov 2025 - Jan 2026
Von Mises-Fisher Mixture Model with Dynamic Shrinkage for Realistic Test-Time Transduction
Core Member
Feb 2026 - Apr 2026
TouchAnything: Dataset and Framework for Bimanual Tactile Estimation from Egocentric Video
Core Member
Aug 2025 - Jan 2026
Multimodal Large Language Models for Industrial Quality Inspection
Core Member
Dec 2024 - Nov 2025
AI-Powered Microscopic Parasite Recognition and Extraction System
Core Member
Sep 2025 - Dec 2025
Deterministic Transition State Prediction via Flow Matching and Equivariant Geometric Learning
Core Developer

Educations

Harbin Institute of Technology (Shenzhen)
Bachelor of Engineering in Automation
2023 - 2027

Experience

Hong Kong University of Science and Technology
Visiting Student
Advisor: Prof. Song Guo
April 2026 - Present
Tsinghua University
Research Intern
Advisor: Prof. Zhi Wang
November 2025 - April 2026
Harbin Institute of Technology (Shenzhen)
Research Intern
Advisor: Prof. Shuo Yang
March 2025 - April 2026