0

Jing Wang

Papers
19

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
19papers

Authored papers

19

World Action Models are Zero-shot Policies

arXiv 2026

2026

SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing

arXiv 2026

2026

DreamGen: Unlocking Generalization in Robot Learning through Video World Models

arXiv 2025

2025

RepText: Rendering Visual Text via Replicating

arXiv 2025

2025

Kwai Keye-VL 1.5 Technical Report

arXiv 2025

2025

IF-VidCap: Can Video Caption Models Follow Instructions?

arXiv 2025

2025

MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources

arXiv 2025

2025

GR00T N1: An Open Foundation Model for Generalist Humanoid Robots

arXiv 2025

2025

FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance

arXiv 2024

2024

Qihoo-T2X: An Efficient Proxy-Tokenized Diffusion Transformer for Text-to-Any-Task

arXiv 2024

2024

MusicMamba: A Dual-Feature Modeling Approach for Generating Chinese Traditional Music with Modal Precision

arXiv 2024

2024

KIND: Knowledge Integration and Diversion for Training Decomposable Models

arXiv 2024

2024

Model Editing for LLMs4Code: How Far are We?

arXiv 2024

2024

VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model

arXiv 2024

2024

TVR-Ranking: A Dataset for Ranked Video Moment Retrieval with Imprecise Queries

arXiv 2024

2024

Towards Coarse-to-Fine Evaluation of Inference Efficiency for Large Language Models

arXiv 2024

2024

MS-DETR: Natural Language Video Localization with Sampling Moment-Moment Interaction

arXiv 2023

2023

Unsupervised Contrast-Consistent Ranking with Language Models

arXiv 2023

2023

LogoDet-3K: A Large-Scale Image Dataset for Logo Detection

arXiv 2020

2020

Affiliations

No known affiliations.

Frequent co-authors

10

from 19 papers