0

Qi Wang

Papers
21

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
21papers

Authored papers

21

World Action Models are Zero-shot Policies

arXiv 2026

2026

MoCapAnything V2: End-to-End Motion Capture for Arbitrary Skeletons

arXiv 2026

2026

DreamGen: Unlocking Generalization in Robot Learning through Video World Models

arXiv 2025

2025

ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World

arXiv 2025

2025

UniMoE-Audio: Unified Speech and Music Generation with Dynamic-Capacity MoE

arXiv 2025

2025

From One to More: Contextual Part Latents for 3D Generation

ICCV 2025

2025

RLEP: Reinforcement Learning with Experience Replay for LLM Reasoning

arXiv 2025

2025

Scale Efficient Training for Large Datasets

scale-efficient-training-for-large-datasets

2025

Leanabell-Prover: Posttraining Scaling in Formal Reasoning

arXiv 2025

2025

Capybara-OMNI: An Efficient Paradigm for Building Omni-Modal Language Models

arXiv 2025

2025

GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

arXiv 2025

2025

The Unanticipated Asymmetry Between Perceptual Optimization and Assessment

arXiv 2025

2025

ScreenAgent: A Vision Language Model-driven Computer Control Agent

arXiv 2024

2024

Towards Next-Generation LLM-based Recommender Systems: A Survey and Beyond

arXiv 2024

2024

DailyDVS-200: A Comprehensive Benchmark Dataset for Event-Based Action Recognition

arXiv 2024

2024

LDM: Large Tensorial SDF Model for Textured Mesh Generation

arXiv 2024

2024

MuChin: A Chinese Colloquial Description Benchmark for Evaluating Language Models in the Field of Music

arXiv 2024

2024

STAR: A First-Ever Dataset and A Large-Scale Benchmark for Scene Graph Generation in Large-Size Satellite Imagery

arXiv 2024

2024

Pretraining is All You Need: A Multi-Atlas Enhanced Transformer Framework for Autism Spectrum Disorder Classification

arXiv 2023

2023

DISGAN: Wavelet-informed Discriminator Guides GAN to MRI Super-resolution with Noise Cleaning

arXiv 2023

2023

RenderIH: A Large-scale Synthetic Dataset for 3D Interacting Hand Pose Estimation

ICCV 2023 1

2023

Affiliations

No known affiliations.

Frequent co-authors

10

from 21 papers