Jiaqi Liu
- Papers
- 24
Cite
Notes
Only stored in your browser.
Authored papers
24AutoResearchClaw: Self-Reinforcing Autonomous Research with Human-AI Collaboration
arXiv 2026
MetaClaw: Just Talk -- An Agent That Meta-Learns and Evolves in the Wild
arXiv 2026
SimpleMem: Efficient Lifelong Memory for LLM Agents
arXiv 2026
SkillRL: Evolving Agents via Recursive Skill-Augmented Reinforcement Learning
arXiv 2026
ClawArena: Benchmarking AI Agents in Evolving Information Environments
arXiv 2026
Mixture of Horizons in Action Chunking
arXiv 2025
A Survey of Scientific Large Language Models: From Data Foundations to Agent Frontiers
arXiv 2025
Multi-focal Conditioned Latent Diffusion for Person Image Synthesis
CVPR 2025 1
UQABench: Evaluating User Embedding for Prompting LLMs in Personalized Question Answering
arXiv 2025
Interact, Instruct to Improve: A LLM-Driven Parallel Actor-Reasoner Framework for Enhancing Autonomous Vehicle Interactions
arXiv 2025
Thinking with Images for Multimodal Reasoning: Foundations, Methods, and Future Frontiers
arXiv 2025
Agent0-VL: Exploring Self-Evolving Agent for Tool-Integrated Vision-Language Reasoning
arXiv 2025
Agent0: Unleashing Self-Evolving Agents from Zero Data via Tool-Integrated Reasoning
arXiv 2025
Ovis-Image Technical Report
arXiv 2025
Medal S: Spatio-Textual Prompt Model for Medical Segmentation
arXiv 2025
AgentGym-RL: Training LLM Agents for Long-Horizon Decision Making through Multi-Turn Reinforcement Learning
arXiv 2025
BAPO: Stabilizing Off-Policy Reinforcement Learning for LLMs via Balanced Policy Optimization with Adaptive Clipping
arXiv 2025
BMMR: A Large-Scale Bilingual Multimodal Multi-Discipline Reasoning Dataset
arXiv 2025
SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines
arXiv 2025
Alignment Tipping Process: How Self-Evolution Pushes LLM Agents Off the Rails
arXiv 2025
SciEvalKit: An Open-source Evaluation Toolkit for Scientific General Intelligence
arXiv 2025
Training-free Composite Scene Generation for Layout-to-Image Synthesis
arXiv 2024
MAPPO-PIS: A Multi-Agent Proximal Policy Optimization Method with Prior Intent Sharing for CAVs' Cooperative Decision-Making
arXiv 2024
Deep Industrial Image Anomaly Detection: A Survey
arXiv 2023
Affiliations
Frequent co-authors
10from 24 papers