Deng Cai
- Papers
- 27
Cite
Notes
Only stored in your browser.
Authored papers
27The End of Manual Decoding: Towards Truly End-to-End Language Models
arXiv 2025
SeqPE: Transformer with Sequential Position Encoding
arXiv 2025
MagicID: Hybrid Preference Optimization for ID-Consistent and Dynamic-Preserved Video Customization
ICCV 2025
Retrieval is Accurate Generation
arXiv 2024
A Thorough Examination of Decoding Methods in the Era of LLMs
arXiv 2024
Disperse-Then-Merge: Pushing the Limits of Instruction Tuning via Alignment Tax Reduction
arXiv 2024
Consecutive Batch Model Editing with HooK Layers
arXiv 2024
Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models
arXiv 2024
ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation
arXiv 2024
FlexCAD: Unified and Versatile Controllable CAD Generation with Fine-tuned Large Language Models
arXiv 2024
LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models
arXiv 2024
A Survey on the Honesty of Large Language Models
arXiv 2024
DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading Systems
arXiv 2024
Unchosen Experts Can Contribute Too: Unleashing MoE Models' Power by Self-Contrast
arXiv 2024
Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models
arXiv 2023
StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving
arXiv 2023
A Frustratingly Simple Decoding Method for Neural Text Generation
arXiv 2023
TeCH: Text-guided Reconstruction of Lifelike Clothed Humans
arXiv 2023
MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection
ICCV 2023 1
Reasons to Reject? Aligning Language Models with Judgments
arXiv 2023
One-shot Implicit Animatable Avatars with Model-based Priors
ICCV 2023 1
Large Language Models Meet Harry Potter: A Bilingual Dataset for Aligning Dialogue Agents with Characters
arXiv 2022
Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation
arXiv 2022
CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention
crossformer-a-versatile-vision-transformer-1
Exploiting Reasoning Chains for Multi-hop Science Question Answering
Findings (EMNLP) 2021 11
Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System
ACL 2022 5
Adversarial Mutual Information for Text Generation
ICML 2020 1
Affiliations
Frequent co-authors
10from 27 papers