Cite
Notes
Only stored in your browser.
Attribution
HunyuanVideo 1.5 Technical Report
arXiv 2025
Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key
CVPR 2025 1
Do Not Let Low-Probability Tokens Over-Dominate in RL for LLMs
from 3 papers
Dongqi Han
Dongsheng Li
Xufang Luo
Yunjian Xu
Bing Wu
Bo Peng
Chang Zou
Changlin Li
Coopers Li
Duojun Huang