Jing Wang
- Papers
- 19
Cite
Notes
Only stored in your browser.
Authored papers
19World Action Models are Zero-shot Policies
arXiv 2026
SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing
arXiv 2026
DreamGen: Unlocking Generalization in Robot Learning through Video World Models
arXiv 2025
RepText: Rendering Visual Text via Replicating
arXiv 2025
Kwai Keye-VL 1.5 Technical Report
arXiv 2025
IF-VidCap: Can Video Caption Models Follow Instructions?
arXiv 2025
MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources
arXiv 2025
GR00T N1: An Open Foundation Model for Generalist Humanoid Robots
arXiv 2025
FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance
arXiv 2024
Qihoo-T2X: An Efficient Proxy-Tokenized Diffusion Transformer for Text-to-Any-Task
arXiv 2024
MusicMamba: A Dual-Feature Modeling Approach for Generating Chinese Traditional Music with Modal Precision
arXiv 2024
KIND: Knowledge Integration and Diversion for Training Decomposable Models
arXiv 2024
Model Editing for LLMs4Code: How Far are We?
arXiv 2024
VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model
arXiv 2024
TVR-Ranking: A Dataset for Ranked Video Moment Retrieval with Imprecise Queries
arXiv 2024
Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models
arXiv 2024
MS-DETR: Natural Language Video Localization with Sampling Moment-Moment Interaction
arXiv 2023
Unsupervised Contrast-Consistent Ranking with Language Models
arXiv 2023
LogoDet-3K: A Large-Scale Image Dataset for Logo Detection
arXiv 2020
Affiliations
Frequent co-authors
10from 19 papers