Deng Cai

Papers: 27

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

27papers

Authored papers

SeqPE: Transformer with Sequential Position Encoding

arXiv 2025

2025

The End of Manual Decoding: Towards Truly End-to-End Language Models

arXiv 2025

2025

MagicID: Hybrid Preference Optimization for ID-Consistent and Dynamic-Preserved Video Customization

ICCV 2025

2025

Retrieval is Accurate Generation

arXiv 2024

2024

Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models

arXiv 2024

2024

ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation

arXiv 2024

2024

A Thorough Examination of Decoding Methods in the Era of LLMs

arXiv 2024

2024

FlexCAD: Unified and Versatile Controllable CAD Generation with Fine-tuned Large Language Models

arXiv 2024

2024

LoRA-Composer: Leveraging Low-Rank Adaptation for Multi-Concept Customization in Training-Free Diffusion Models

arXiv 2024

2024

A Survey on the Honesty of Large Language Models

arXiv 2024

2024

DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading Systems

arXiv 2024

2024

Unchosen Experts Can Contribute Too: Unleashing MoE Models' Power by Self-Contrast

arXiv 2024

2024

Disperse-Then-Merge: Pushing the Limits of Instruction Tuning via Alignment Tax Reduction

arXiv 2024

2024

Consecutive Batch Model Editing with HooK Layers

arXiv 2024

2024

Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

arXiv 2023

2023

TeCH: Text-guided Reconstruction of Lifelike Clothed Humans

arXiv 2023

2023

MonoNeRD: NeRF-like Representations for Monocular 3D Object Detection

ICCV 2023 1

2023

StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving

arXiv 2023

2023

A Frustratingly Simple Decoding Method for Neural Text Generation

arXiv 2023

2023

Reasons to Reject? Aligning Language Models with Judgments

arXiv 2023

2023

One-shot Implicit Animatable Avatars with Model-based Priors

ICCV 2023 1

2022

Large Language Models Meet Harry Potter: A Bilingual Dataset for Aligning Dialogue Agents with Characters

arXiv 2022

2022

Learning to Break the Loop: Analyzing and Mitigating Repetitions for Neural Text Generation

arXiv 2022

2022

CrossFormer: A Versatile Vision Transformer Hinging on Cross-scale Attention

crossformer-a-versatile-vision-transformer-1

2021

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

ACL 2022 5

2021

Exploiting Reasoning Chains for Multi-hop Science Question Answering

Findings (EMNLP) 2021 11

2021

Adversarial Mutual Information for Text Generation

ICML 2020 1

2020

Affiliations

No known affiliations.

Frequent co-authors

from 27 papers

Shuming Shi

Wai Lam

Chufan Shi

Huayang Li

Leyang Cui

Wei Bi

Wenxiao Wang

Yujiu Yang

Boxi Wu

Cheng Yang