Haozhe Zhao
- Papers
- 15
Cite
Notes
Only stored in your browser.
Authored papers
15From Context to Skills: Can Language Models Learn from Context Skillfully?
arXiv 2026
LongCat-Next: Lexicalizing Modalities as Discrete Tokens
arXiv 2026
Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning
arXiv 2025
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think
arXiv 2025
MMGR: Multi-Modal Generative Reasoning
arXiv 2025
FaithLens: Detecting and Explaining Faithfulness Hallucination
arXiv 2025
An Image is Worth 1/2 Tokens After Layer 2: Plug-and-Play Inference Acceleration for Large Vision-Language Models
arXiv 2024
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
arXiv 2024
PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain
arXiv 2024
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
arXiv 2024
GATEAU: Selecting Influential Samples for Long Context Alignment
arXiv 2024
MMEvalPro: Calibrating Multimodal Benchmarks Towards Trustworthy and Efficient Evaluation
arXiv 2024
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
arXiv 2024
ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code
arXiv 2023
MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning
arXiv 2023
Affiliations
Frequent co-authors
10from 15 papers