Keming Lu
- Papers
- 19
Cite
Notes
Only stored in your browser.
Authored papers
19WorldPM: Scaling Human Preference Modeling
arXiv 2025
MARGE: Improving Math Reasoning for LLMs with Guided Exploration
arXiv 2025
Qwen2.5 Technical Report
arXiv 2024
Qwen2 Technical Report
arXiv 2024
Online Merging Optimizers for Boosting Rewards and Mitigating Tax in Alignment
arXiv 2024
LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language Feedback
arXiv 2024
Aligning Large Language Models via Self-Steering Optimization
arXiv 2024
PyramidKV: Dynamic KV Cache Compression based on Pyramidal Information Funneling
arXiv 2024
Self-play with Execution Feedback: Improving Instruction-following Capabilities of Large Language Models
arXiv 2024
Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment
arXiv 2024
Towards a Unified View of Preference Learning for Large Language Models: A Survey
arXiv 2024
ProcessBench: Identifying Process Errors in Mathematical Reasoning
arXiv 2024
Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model
arXiv 2024
Qwen Technical Report
arXiv 2023
MuggleMath: Assessing the Impact of Query and Response Augmentation on Math Reasoning
arXiv 2023
Self-Evolved Diverse Data Sampling for Efficient Instruction Tuning
arXiv 2023
#InsTag: Instruction Tagging for Analyzing Supervised Fine-tuning of Large Language Models
arXiv 2023
Summarization as Indirect Supervision for Relation Extraction
arXiv 2022
Multi-hop Evidence Retrieval for Cross-document Relation Extraction
arXiv 2022
Affiliations
Frequent co-authors
10from 19 papers