Skip to content

AMAP-ML/Taming-Hallucinations

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

10 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

Taming Hallucinations

Taming Hallucinations: Boosting MLLMsโ€™ Video Understanding via Counterfactual Video Generation

๐Ÿ  Project Page | Paper

TL;DR: Taming Hallucinations introduces DualityForge, a controllable diffusion-based framework that turns real videos into counterfactual ones, automatically generating paired videos and QA data for contrastive training. Based on the large-scale DualityVidQA dataset and the proposed DNA-Train SFTโ€“RL regime with โ„“1-normalized advantages, our approach reduces hallucinations in multimodal LLMs by 24% and shows strong generalization across benchmarks. Dataset and code will be released.

Method Overview

image

Update

๐Ÿ“Ž Citation

If you find this repository useful, please consider citing:

@article{huang2025taming,
  title={Taming Hallucinations: Boosting MLLMs' Video Understanding via Counterfactual Video Generation},
  author={Huang, Zhe and Wen, Hao and Hao, Aiming and Song, Bingze and Wu, Meiqi and Wu, Jiahong and Chu, Xiangxiang and Lu, Sheng and Wang, Haoqian},
  journal={arXiv preprint arXiv:2512.24271},
  year={2025}
}

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published