Qi Chen
- Papers
- 27
Cite
Notes
Only stored in your browser.
Authored papers
27AcademiClaw: When Students Set Challenges for AI Agents
arXiv 2026
MOVA: Towards Scalable and Synchronized Video-Audio Generation
arXiv 2026
HY-World 2.0: A Multi-Modal World Model for Reconstructing, Generating, and Simulating 3D Worlds
arXiv 2026
RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference
arXiv 2025
ViSAudio: End-to-End Video-Driven Binaural Spatial Audio Generation
arXiv 2025
InMind: Evaluating LLMs in Capturing and Applying Individual Human Reasoning Styles
arXiv 2025
Efficient Response Generation Method Selection for Fine-Tuning Large Language Models
arXiv 2025
EpiCoder: Encompassing Diversity and Complexity in Code Generation
arXiv 2025
Beyond Prompt Content: Enhancing LLM Performance via Content-Format Integrated Prompt Optimization
arXiv 2025
ScaleMAI: Accelerating the Development of Trusted Datasets and AI Models
arXiv 2025
AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding
arXiv 2024
MS MARCO Web Search: a Large-scale Information-rich Web Dataset with Millions of Real Click Labels
arXiv 2024
KMM: Key Frame Mask Mamba for Extended Motion Generation
arXiv 2024
Attention-driven GUI Grounding: Leveraging Pretrained Multimodal Large Language Models without Fine-Tuning
arXiv 2024
InfiniMotion: Mamba Boosts Memory in Transformer for Arbitrary Long Motion Generation
arXiv 2024
MMCLIP: Cross-modal Attention Masked Modelling for Medical Language-Image Pre-Training
arXiv 2024
Text-Driven Tumor Synthesis
arXiv 2024
RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval
arXiv 2024
A Survey of Medical Vision-and-Language Applications and Their Techniques
arXiv 2024
Integrative Decoding: Improve Factuality via Implicit Self-consistency
arXiv 2024
Likelihood-Based Text-to-Image Evaluation with Patch-Level Perceptual and Semantic Credit Assignment
arXiv 2023
WebVLN: Vision-and-Language Navigation on Websites
arXiv 2023
Prompt Switch: Efficient CLIP Adaptation for Text-Video Retrieval
ICCV 2023 1
IRGen: Generative Modeling for Image Retrieval
arXiv 2023
Distill-VQ: Learning Retrieval Oriented Vector Quantization By Distilling Knowledge from Dense Embeddings
arXiv 2022
SPANN: Highly-efficient Billion-scale Approximate Nearest Neighbor Search
spann-highly-efficient-billion-scale-1
MonaLog: a Lightweight System for Natural Language Inference Based on Monotonicity
SCiL 2020 1
Affiliations
Frequent co-authors
10from 27 papers