Xin Chen
- Papers
- 35
Cite
Notes
Only stored in your browser.
Authored papers
35Fish Audio S2 Technical Report
arXiv 2026
Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development
arXiv 2026
LongCat-Flash-Thinking-2601 Technical Report
arXiv 2026
MASQuant: Modality-Aware Smoothing Quantization for Multimodal Large Language Models
arXiv 2026
OpenVLThinkerV2: A Generalist Multimodal Reasoning Model for Multi-domain Visual Tasks
arXiv 2026
AgentArk: Distilling Multi-Agent Intelligence into a Single LLM Agent
arXiv 2026
BAPO: Boundary-Aware Policy Optimization for Reliable Agentic Search
arXiv 2026
Video-As-Prompt: Unified Semantic Control for Video Generation
arXiv 2025
SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines
arXiv 2025
MTSQL-R1: Towards Long-Horizon Multi-Turn Text-to-SQL via Agentic Training
arXiv 2025
Baichuan-Omni-1.5 Technical Report
arXiv 2025
Bridging Your Imagination with Audio-Video Generation via a Unified Director
arXiv 2025
Probing Scientific General Intelligence of LLMs with Scientist-Aligned Workflows
arXiv 2025
Baichuan-M1: Pushing the Medical Capability of Large Language Models
arXiv 2025
MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers
arXiv 2024
MeshXL: Neural Coordinate Field for Generative 3D Foundation Models
arXiv 2024
MVPaint: Synchronized Multi-View Diffusion for Painting Anything 3D
CVPR 2025 1
Content-Based Collaborative Generation for Recommender Systems
arXiv 2024
Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Models
arXiv 2024
GraphHash: Graph Clustering Enables Parameter Efficiency in Recommender Systems
arXiv 2024
AppAgent: Multimodal Agents as Smartphone Users
arXiv 2023
MotionGPT: Human Motion as a Foreign Language
NeurIPS 2023 11
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation
michelangelo-conditional-3d-shape-generation
Sketched Ridgeless Linear Regression: The Role of Downsampling
arXiv 2023
Paint3D: Paint Anything 3D with Lighting-Less Texture Diffusion Models
CVPR 2024 1
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding, Reasoning, and Planning
arXiv 2023
Exploring Lightweight Hierarchical Vision Transformers for Efficient Visual Tracking
ICCV 2023 1
QA-LoRA: Quantization-Aware Low-Rank Adaptation of Large Language Models
arXiv 2023
To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Unsafe Images ... For Now
arXiv 2023
M3DBench: Let's Instruct Large Models with Multi-modal 3D Prompts
arXiv 2023
Text-Visual Prompting for Efficient 2D Temporal Video Grounding
CVPR 2023 1
Pangu-Weather: A 3D High-Resolution Model for Fast and Accurate Global Weather Forecast
arXiv 2022
Contrastive Deep Supervision
arXiv 2022
PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search
ICLR 2020 1
Beyond saliency: understanding convolutional neural networks from saliency prediction on layer-wise relevance propagation
arXiv 2017
Affiliations
Frequent co-authors
10from 35 papers