Chaofan Tao
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13SWE-Lego: Pushing the Limits of Supervised Fine-tuning for Software Issue Resolving
arXiv 2026
Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models
arXiv 2026
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation
arXiv 2026
MMDeepResearch-Bench: A Benchmark for Multimodal Deep Research Agents
arXiv 2026
PhyX: Does Your Model Have the "Wits" for Physical Reasoning?
arXiv 2025
The Synergy Dilemma of Long-CoT SFT and RL: Investigating Post-Training Techniques for Reasoning VLMs
arXiv 2025
LongEmotion: Measuring Emotional Intelligence of Large Language Models in Long-Context Interaction
arXiv 2025
Autoregressive Models in Vision: A Survey
arXiv 2024
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies
arXiv 2024
Rethinking Kullback-Leibler Divergence in Knowledge Distillation for Large Language Models
arXiv 2024
MEIT: Multi-Modal Electrocardiogram Instruction Tuning on Large Language Models for Report Generation
arXiv 2024
UPop: Unified and Progressive Pruning for Compressing Vision-Language Transformers
arXiv 2023
This Looks Like That: Deep Learning for Interpretable Image Recognition
this-looks-like-that-deep-learning-for-1
Affiliations
Frequent co-authors
10from 13 papers