0

Tao Wang

Papers
26

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
26papers

Authored papers

26

DrawMotion: Generating 3D Human Motions by Freehand Drawing

arXiv 2026

2026

PRBench: End-to-end Paper Reproduction in Physics Research

arXiv 2026

2026

MedAgentGym: Training LLM Agents for Code-Based Medical Reasoning at Scale

arXiv 2025

2025

ALTo: Adaptive-Length Tokenizer for Autoregressive Mask Generation

arXiv 2025

2025

Pseudo-Knowledge Graph: Meta-Path Guided Retrieval and In-Graph Text for RAG-Equipped LLM

arXiv 2025

2025

T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation

arXiv 2025

2025

Let It Flow: Agentic Crafting on Rock and Roll, Building the ROME Model within an Open Agentic Learning Ecosystem

arXiv 2025

2025

ToReMi: Topic-Aware Data Reweighting for Dynamic Pre-Training Data Selection

arXiv 2025

2025

HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model

ICCV 2025

2025

MaterialMVP: Illumination-Invariant Material Generation via Multi-view PBR Diffusion

ICCV 2025

2025

Visual Style Prompt Learning Using Diffusion Models for Blind Face Restoration

arXiv 2024

2024

OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion Models

arXiv 2024

2024

GroundingGPT:Language Enhanced Multi-modal Grounding Model

arXiv 2024

2024

UnifiedMLLM: Enabling Unified Representation for Multi-modal Multi-tasks With Large Language Model

arXiv 2024

2024

WMCodec: End-to-End Neural Speech Codec with Deep Watermarking for Authenticity Verification

arXiv 2024

2024

An Intelligent Remote Sensing Image Quality Inspection System

arXiv 2023

2023

Student Classroom Behavior Detection based on YOLOv7-BRA and Multi-Model Fusion

arXiv 2023

2023

Valley: Video Assistant with Large Language model Enhanced abilitY

arXiv 2023

2023

Towards Real-World Blind Face Restoration with Generative Diffusion Prior

arXiv 2023

2023

Fewer-token Neural Speech Codec with Time-invariant Codes

arXiv 2023

2023

GridFormer: Residual Dense Transformer with Grid Structure for Image Restoration in Adverse Weather Conditions

arXiv 2023

2023

Deep Face Restoration: A Survey

arXiv 2022

2022

Ultra-High-Definition Low-Light Image Enhancement: A Benchmark and Transformer-Based Method

arXiv 2022

2022

Tokens-to-Token ViT: Training Vision Transformers from Scratch on ImageNet

ICCV 2021 10

2021

MC-Blur: A Comprehensive Benchmark for Image Deblurring

arXiv 2021

2021

Automated Concatenation of Embeddings for Structured Prediction

automated-concatenation-of-embeddings-for

2020

Affiliations

No known affiliations.

Frequent co-authors

10

from 26 papers