0

Tao Chen

Papers
31

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
31papers

Authored papers

31

InternAgent-1.5: A Unified Agentic Framework for Long-Horizon Autonomous Scientific Discovery

arXiv 2026

2026

Reward Hacking in the Era of Large Models: Mechanisms, Emergent Misalignment, Challenges

arXiv 2026

2026

CurveStream: Boosting Streaming Video Understanding in MLLMs via Curvature-Aware Hierarchical Visual Memory Management

arXiv 2026

2026

SIGMA-PPG: Statistical-prior Informed Generative Masking Architecture for PPG Foundation Model

arXiv 2026

2026

OmniCaptioner: One Captioner to Rule Them All

arXiv 2025

2025

Dolphin: Closed-loop Open-ended Auto-research through Thinking, Practice, and Feedback

arXiv 2025

2025

MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs

arXiv 2025

2025

Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers

arXiv 2025

2025

Medal S: Spatio-Textual Prompt Model for Medical Segmentation

arXiv 2025

2025

RegionE: Adaptive Region-Aware Generation for Efficient Image Editing

arXiv 2025

2025

PRING: Rethinking Protein-Protein Interaction Prediction from Pairs to Graphs

arXiv 2025

2025

Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy

arXiv 2024

2024

MeshXL: Neural Coordinate Field for Generative 3D Foundation Models

arXiv 2024

2024

MovieLLM: Enhancing Long Video Understanding with AI-Generated Movies

arXiv 2024

2024

Biology Instructions: A Dataset and Benchmark for Multi-Omics Sequence Understanding Capability of Large Language Models

arXiv 2024

2024

GeoX: Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training

arXiv 2024

2024

One-Shot Learning for Pose-Guided Person Image Synthesis in the Wild

arXiv 2024

2024

Once for Both: Single Stage of Importance and Sparsity Search for Vision Transformer Compression

CVPR 2024 1

2024

MotionGPT: Human Motion as a Foreign Language

NeurIPS 2023 11

2023

ReSimAD: Zero-Shot 3D Domain Transfer for Autonomous Driving with Source Reconstruction and Target Simulation

arXiv 2023

2023

LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning

arXiv 2023

2023

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

michelangelo-conditional-3d-shape-generation

2023

Make-A-Character: High Quality Text-to-3D Character Generation within Minutes

arXiv 2023

2023

QuadSwarm: A Modular Multi-Quadrotor Simulator for Deep Reinforcement Learning with Direct Thrust Control

arXiv 2023

2023

Performance-aware Approximation of Global Channel Pruning for Multitask CNNs

arXiv 2023

2023

MMICT: Boosting Multi-Modal Fine-Tuning with In-Context Examples

arXiv 2023

2023

StyleAvatar3D: Leveraging Image-Text Diffusion Models for High-Fidelity 3D Avatar Generation

arXiv 2023

2023

M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts

arXiv 2023

2023

Visual Dexterity: In-Hand Reorientation of Novel and Complex Object Shapes

arXiv 2022

2022

UniKeyphrase: A Unified Extraction and Generation Framework for Keyphrase Prediction

Findings (ACL) 2021 8

2021

SParC: Cross-Domain Semantic Parsing in Context

sparc-cross-domain-semantic-parsing-in-1

2019

Affiliations

No known affiliations.

Frequent co-authors

10

from 31 papers