0

Keming Lu

Papers
19

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
19papers

Authored papers

19

WorldPM: Scaling Human Preference Modeling

arXiv 2025

2025

MARGE: Improving Math Reasoning for LLMs with Guided Exploration

arXiv 2025

2025

Qwen2.5 Technical Report

arXiv 2024

2024

Qwen2 Technical Report

arXiv 2024

2024

Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment

arXiv 2024

2024

LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback

arXiv 2024

2024

Aligning Large Language Models via Self-Steering Optimization

arXiv 2024

2024

PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling

arXiv 2024

2024

Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models

arXiv 2024

2024

Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment

arXiv 2024

2024

Towards a Unified View of Preference Learning for Large Language Models: A Survey

arXiv 2024

2024

ProcessBench: Identifying Process Errors in Mathematical Reasoning

arXiv 2024

2024

Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model

arXiv 2024

2024

Qwen Technical Report

arXiv 2023

2023

MuggleMath: Assessing the Impact of Query and Response Augmentation on Math Reasoning

arXiv 2023

2023

Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning

arXiv 2023

2023

#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models

arXiv 2023

2023

Summarization as Indirect Supervision for Relation Extraction

arXiv 2022

2022

Multi-hop Evidence Retrieval for Cross-document Relation Extraction

arXiv 2022

2022

Affiliations

No known affiliations.

Frequent co-authors

10

from 19 papers