Kaixin Ma
- Papers
- 13
Cite
Notes
Only stored in your browser.
Authored papers
13WebEvolver: Enhancing Web Agent Self-Improvement with Coevolving World Model
arXiv 2025
WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models
arXiv 2024
RepoGraph: Enhancing AI Software Engineering with Repository-level Code Graph
arXiv 2024
Leopard: A Vision Language Model For Text-Rich Multi-Image Tasks
arXiv 2024
DSBench: How Far Are Data Science Agents to Becoming Data Science Experts?
arXiv 2024
OpenWebVoyager: Building Multimodal Web Agents via Iterative Real-World Exploration, Feedback and Optimization
arXiv 2024
DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading Systems
arXiv 2024
Cognitive Kernel: An Open-source Agent System towards Generalist Autopilots
arXiv 2024
MARVEL: Multidimensional Abstraction and Reasoning through Visual Evaluation and Learning
arXiv 2024
DivScene: Benchmarking LVLMs for Object Navigation with Diverse Scenes and Objects
arXiv 2024
COLUMBUS: Evaluating COgnitive Lateral Understanding through Multiple-choice reBUSes
arXiv 2024
Dense X Retrieval: What Retrieval Granularity Should We Use?
arXiv 2023
LASER: LLM Agent with State-Space Exploration for Web Navigation
arXiv 2023
Affiliations
Frequent co-authors
10from 13 papers