Xuezhi Cao
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13WBench: A Comprehensive Multi-turn Benchmark for Interactive Video World Model Evaluation
arXiv 2026
General365: Benchmarking General Reasoning in Large Language Models Across Diverse and Challenging Tasks
arXiv 2026
LongCat-Next: Lexicalizing Modalities as Discrete Tokens
arXiv 2026
LongCat-Flash-Thinking-2601 Technical Report
arXiv 2026
LARY: A Latent Action Representation Yielding Benchmark for Generalizable Vision-to-Action Alignment
arXiv 2026
LongCat-Flash-Prover: Advancing Native Formal Reasoning via Agentic Tool-Integrated Reinforcement Learning
arXiv 2026
Leveraging Dual Process Theory in Language Agent Framework for Real-time Simultaneous Human-AI Collaboration
arXiv 2025
Q-Eval-100K: Evaluating Visual Quality and Alignment Level for Text-to-Vision Content
CVPR 2025 1
AMO-Bench: Large Language Models Still Struggle in High School Math Competitions
arXiv 2025
R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?
arXiv 2025
I2CR: Intra- and Inter-modal Collaborative Reflections for Multimodal Entity Linking
arXiv 2025
Making Mathematical Reasoning Adaptive
arXiv 2025
A Wolf in Sheep's Clothing: Generalized Nested Jailbreak Prompts can Fool Large Language Models Easily
arXiv 2023
Affiliations
Frequent co-authors
10from 13 papers