Jiaxin Ge
- Papers
- 10
Cite
Notes
Only stored in your browser.
Authored papers
10VisGym: Diverse, Customizable, Scalable Environments for Multimodal Agents
arXiv 2026
Generate, but Verify: Reducing Hallucination in Vision-Language Models with Retrospective Resampling
arXiv 2025
Chain-of-Visual-Thought: Teaching VLMs to See and Think Better with Continuous Visual Tokens
arXiv 2025
Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint
arXiv 2025
Constantly Improving Image Models Need Constantly Improving Benchmarks
arXiv 2025
AutoPresent: Designing Structured Visuals from Scratch
CVPR 2025 1
EmpathyAgent: Can Embodied Agents Conduct Empathetic Actions?
arXiv 2025
Learning from Mistakes: Iterative Prompt Relabeling for Text-to-Image Diffusion Model Training
arXiv 2023
Natural Language Embedded Programs for Hybrid Language Symbolic Reasoning
arXiv 2023
Entailment as Robust Self-Learner
arXiv 2023
Affiliations
Frequent co-authors
10from 10 papers