Zefan Cai
- Papers
- 17
Cite
Notes
Only stored in your browser.
Authored papers
17BabyVision: Visual Reasoning Beyond Language
arXiv 2026
R-KV: Redundancy-aware KV Cache Compression for Training-Free Reasoning Models Acceleration
arXiv 2025
A Survey on Latent Reasoning
arXiv 2025
HeadInfer: Memory-Efficient LLM Inference by Head-wise Offloading
arXiv 2025
AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence
arXiv 2025
MMGR: Multi-Modal Generative Reasoning
arXiv 2025
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey
arXiv 2024
Omni-MATH: A Universal Olympiad Level Mathematic Benchmark For Large Language Models
arXiv 2024
Towards a Unified View of Preference Learning for Large Language Models: A Survey
arXiv 2024
PCA-Bench: Evaluating Multimodal Large Language Models in Perception-Cognition-Action Chain
arXiv 2024
A Spark of Vision-Language Intelligence: 2-Dimensional Autoregressive Transformer for Efficient Finegrained Image Generation
arXiv 2024
Not All Heads Matter: A Head-Level KV Cache Compression Method with Integrated Retrieval and Reasoning
arXiv 2024
LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback
arXiv 2024
PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling
arXiv 2024
ML-Bench: Evaluating Large Language Models and Agents for Machine Learning Tasks on Repository-Level Code
arXiv 2023
Large Language Models are not Fair Evaluators
arXiv 2023
MMICL: Empowering Vision-language Model with Multi-Modal In-Context Learning
arXiv 2023
Affiliations
Frequent co-authors
10from 17 papers