Qi Wang
- Papers
- 21
Cite
Notes
Only stored in your browser.
Authored papers
21World Action Models are Zero-shot Policies
arXiv 2026
MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons
arXiv 2026
DreamGen: Unlocking Generalization in Robot Learning through Video World Models
arXiv 2025
ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World
arXiv 2025
UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE
arXiv 2025
From One to More: Contextual Part Latents for 3D Generation
ICCV 2025
RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning
arXiv 2025
Scale Efficient Training for Large Datasets
scale-efficient-training-for-large-datasets
Leanabell-Prover: Posttraining Scaling in Formal Reasoning
arXiv 2025
Capybara-OMNI: An Efficient Paradigm for Building Omni-Modal Language Models
arXiv 2025
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots
arXiv 2025
The Unanticipated Asymmetry Between Perceptual Optimization and Assessment
arXiv 2025
ScreenAgent: A Vision Language Model-driven Computer Control Agent
arXiv 2024
Towards Next-Generation LLM-based Recommender Systems: A Survey and Beyond
arXiv 2024
DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition
arXiv 2024
LDM: Large Tensorial SDF Model for Textured Mesh Generation
arXiv 2024
MuChin: A Chinese Colloquial Description Benchmark for Evaluating Language Models in the Field of Music
arXiv 2024
STAR: A First-Ever Dataset and A Large-Scale Benchmark for Scene Graph Generation in Large-Size Satellite Imagery
arXiv 2024
Pretraining is All You Need: A Multi-Atlas Enhanced Transformer Framework for Autism Spectrum Disorder Classification
arXiv 2023
DISGAN: Wavelet-informed Discriminator Guides GAN to MRI Super-resolution with Noise Cleaning
arXiv 2023
RenderIH: A Large-scale Synthetic Dataset for 3D Interacting Hand Pose Estimation
ICCV 2023 1
Affiliations
Frequent co-authors
10from 21 papers