Minjia Zhang
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12Spreadsheet-RL: Advancing Large Language Model Agents on Realistic Spreadsheet Tasks via Reinforcement Learning
arXiv 2026
From Context to Skills: Can Language Models Learn from Context Skillfully?
arXiv 2026
VTool-R1: VLMs Learn to Think with Images via Reinforcement Learning on Multimodal Tool Use
arXiv 2025
Teaching Large Language Models to Maintain Contextual Faithfulness via Synthetic Tasks and Reinforcement Learning
arXiv 2025
Slow-Fast Policy Optimization: Reposition-Before-Update for LLM Reasoning
arXiv 2025
FaithLens: Detecting and Explaining Faithfulness Hallucination
arXiv 2025
MMGR: Multi-Modal Generative Reasoning
arXiv 2025
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
arXiv 2024
Improving Retrieval-Augmented Generation in Medicine with Iterative Follow-up Questions
arXiv 2024
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
arXiv 2024
DeepSpeed-VisualChat: Multi-Round Multi-Image Interleave Chat via Multi-Modal Causal Attention
arXiv 2023
Model Tells You What to Discard: Adaptive KV Cache Compression for LLMs
arXiv 2023
Affiliations
Frequent co-authors
10from 12 papers