0

Shuming Shi

Papers
26

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
26papers

Authored papers

26

Quantitative Analysis of Performance Drop in DeepSeek Model Quantization

arXiv 2025

2025

Retrieval is Accurate Generation

arXiv 2024

2024

Knowledge Verification to Nip Hallucination in the Bud

arXiv 2024

2024

Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models

arXiv 2024

2024

Disperse-Then-Merge: Pushing the Limits of Instruction Tuning via Alignment Tax Reduction

arXiv 2024

2024

Benchmarking LLMs via Uncertainty Quantification

arXiv 2024

2024

Inferflow: an Efficient and Highly Configurable Inference Engine for Large Language Models

arXiv 2024

2024

StrategyLLM: Large Language Models as Strategy Generators, Executors, Optimizers, and Evaluators for Problem Solving

arXiv 2023

2023

A Frustratingly Simple Decoding Method for Neural Text Generation

arXiv 2023

2023

Leveraging Word Guessing Games to Assess the Intelligence of Large Language Models

arXiv 2023

2023

IMTLab: An Open-Source Platform for Building, Evaluating, and Diagnosing Interactive Machine Translation Systems

arXiv 2023

2023

DoG-Instruct: Towards Premium Instruction-Tuning Data via Text-Grounded Instruction Wrapping

arXiv 2023

2023

GPT-4 Is Too Smart To Be Safe: Stealthy Chat with LLMs via Cipher

arXiv 2023

2023

Macaw-LLM: Multi-Modal Language Modeling with Image, Audio, Video, and Text Integration

arXiv 2023

2023

Siren's Song in the AI Ocean: A Survey on Hallucination in Large Language Models

arXiv 2023

2023

Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate

arXiv 2023

2023

Is ChatGPT A Good Translator? Yes With GPT-4 As The Engine

arXiv 2023

2023

MAGE: Machine-generated Text Detection in the Wild

arXiv 2023

2023

Exploring Human-Like Translation Strategy with Large Language Models

arXiv 2023

2023

Reasons to Reject? Aligning Language Models with Judgments

arXiv 2023

2023

Explore-Instruct: Enhancing Domain-Specific Instruction Coverage through Active Exploration

arXiv 2023

2023

On the Evaluation Metrics for Paraphrase Generation

arXiv 2022

2022

Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine Translation

ACL 2022 5

2022

Exploring and Adapting Chinese GPT to Pinyin Input Method

ACL 2022 5

2022

On the Complementarity between Pre-Training and Back-Translation for Neural Machine Translation

Findings (EMNLP) 2021 11

2021

On the Copying Behaviors of Pre-Training for Neural Machine Translation

Findings (ACL) 2021 8

2021

Affiliations

No known affiliations.

Frequent co-authors

10

from 26 papers