0

Jun Zhou

Papers
40

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
40papers

Authored papers

40

Can LLMs Clean Up Your Mess? A Survey of Application-Ready Data Preparation with LLMs

arXiv 2026

2026

KVPO: ODE-Native GRPO for Autoregressive Video Alignment via KV Semantic Exploration

arXiv 2026

2026

SkillNet: Create, Evaluate, and Connect AI Skills

arXiv 2026

2026

LLaDA2.1: Speeding Up Text Diffusion via Token Editing

arXiv 2026

2026

LLaDA-o: An Effective and Length-Adaptive Omni Diffusion Model

arXiv 2026

2026

InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning

arXiv 2026

2026

LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning

arXiv 2025

2025

STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

ICCV 2025

2025

Ming-Omni: A Unified Multimodal Model for Perception and Generation

arXiv 2025

2025

Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation

ICCV 2025

2025

Effective and Efficient Masked Image Generation Models

arXiv 2025

2025

MVInverse: Feed-forward Multi-view Inverse Rendering in Seconds

arXiv 2025

2025

LLaDA2.0: Scaling Up Diffusion Language Models to 100B

arXiv 2025

2025

MIND-V: Hierarchical Video Generation for Long-Horizon Robotic Manipulation with RL-based Physical Alignment

arXiv 2025

2025

M2-omni: Advancing Omni-MLLM for Comprehensive Modality Support with Competitive Performance

arXiv 2025

2025

Refining Sentence Embedding Model through Ranking Sentences Generation with Large Language Models

arXiv 2025

2025

LookAhead Tuning: Safer Language Models via Partial Answer Previews

arXiv 2025

2025

Controllable Layer Decomposition for Reversible Multi-Layer Image Generation

arXiv 2025

2025

Every Step Evolves: Scaling Reinforcement Learning for Trillion-Scale Thinking Model

arXiv 2025

2025

Ming-UniVision: Joint Image Understanding and Generation with a Unified Continuous Tokenizer

arXiv 2025

2025

M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoning

arXiv 2025

2025

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

arXiv 2025

2025

Every Sample Matters: Leveraging Mixture-of-Experts and High-Quality Data for Efficient and Accurate Code LLM

arXiv 2025

2025

Robust Preference Optimization via Dynamic Target Margins

arXiv 2025

2025

Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs

arXiv 2025

2025

KAG: Boosting LLMs in Professional Domains via Knowledge Augmented Generation

arXiv 2024

2024

TimeMixer: Decomposable Multiscale Mixing for Time Series Forecasting

timemixer-decomposable-multiscale-mixing-for

2024

ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving

arXiv 2024

2024

LLMOPT: Learning to Define and Solve General Optimization Problems from Scratch

arXiv 2024

2024

TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian Splatting

arXiv 2024

2024

OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System

arXiv 2024

2024

OneGen: Efficient One-Pass Unified Generation and Retrieval for LLMs

arXiv 2024

2024

Learning to Plan for Retrieval-Augmented Large Language Models from Knowledge Graphs

arXiv 2024

2024

CMNER: A Chinese Multimodal NER Dataset based on Social Media

arXiv 2024

2024

From Generalist to Specialist: Adapting Vision Language Models via Task-Specific Visual Instruction Tuning

arXiv 2024

2024

RemoteCLIP: A Vision Language Foundation Model for Remote Sensing

arXiv 2023

2023

Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

ICCV 2023 1

2023

EasyTPP: Towards Open Benchmarking Temporal Point Processes

arXiv 2023

2023

Deep Fusion Transformer Network with Weighted Vector-Wise Keypoints Voting for Robust 6D Object Pose Estimation

ICCV 2023 1

2023

Towards Anytime Fine-tuning: Continually Pre-trained Language Models with Hypernetwork Prompt

arXiv 2023

2023

Affiliations

No known affiliations.

Frequent co-authors

10

from 40 papers