Yifan Xu
- Papers
- 20
Cite
Notes
Only stored in your browser.
Authored papers
20Toward Cognitive Supersensing in Multimodal Large Language Model
arXiv 2026
AndroidGen: Building an Android Language Agent under Data Scarcity
arXiv 2025
Neural Motion Simulator: Pushing the Limit of World Models in Reinforcement Learning
arXiv 2025
MMGDreamer: Mixed-Modality Graph for Geometry-Controllable 3D Indoor Scene Generation
mmgdreamer-mixed-modality-graph-for-geometry
Spike-driven Transformer V2: Meta Spiking Neural Network Architecture Inspiring the Design of Next-generation Neuromorphic Chips
arXiv 2024
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
arXiv 2024
Libra: Building Decoupled Vision System on Large Language Models
arXiv 2024
3DMIT: 3D Multi-modal Instruction Tuning for Scene Understanding
arXiv 2024
ChatGLM-Math: Improving Math Problem-Solving in Large Language Models with a Self-Critique Pipeline
arXiv 2024
AndroidLab: Training and Systematic Benchmarking of Android Autonomous Agents
arXiv 2024
VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents
arXiv 2024
AgentBench: Evaluating LLMs as Agents
arXiv 2023
GOAL: A Challenging Knowledge-grounded Video Captioning Benchmark for Real-time Soccer Commentary Generation
arXiv 2023
WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences
arXiv 2023
AlignBench: Benchmarking Chinese Alignment of Large Language Models
arXiv 2023
BLIVA: A Simple Multimodal LLM for Better Handling of Text-Rich Visual Questions
arXiv 2023
GLM-130B: An Open Bilingual Pre-trained Model
arXiv 2022
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
arXiv 2022
Pose Recognition with Cascade Transformers
CVPR 2021 1
Co-Scale Conv-Attentional Image Transformers
ICCV 2021 10
Affiliations
Frequent co-authors
10from 20 papers