Yiming Yang
CMU professor working on machine learning, NLP, and information retrieval; senior author on many LLM-evaluation and post-training papers.
- Role
- professor
- Currently at
- Carnegie Mellon University
- Scholar
- scholar.google.com/citations
- Papers
- 31
Cite
Notes
Only stored in your browser.
Authored papers
31TFG-Flow: Training-free Guidance in Multimodal Generative Flow
arXiv 2025
Towards Community-Driven Agents for Machine Learning Engineering
arXiv 2025
Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning
arXiv 2025
CO-Bench: Benchmarking Language Model Agents in Algorithm Search for Combinatorial Optimization
arXiv 2025
Enhancing Training Data Attribution with Representational Optimization
arXiv 2025
CodePDE: An Inference Framework for LLM-driven PDE Solver Generation
arXiv 2025
Deep Research: A Systematic Survey
arXiv 2025
Regularized Langevin Dynamics for Combinatorial Optimization
arXiv 2025
Agentic-R1: Distilled Dual-Strategy Reasoning
arXiv 2025
Improve Vision Language Model Chain-of-thought Reasoning
arXiv 2024
Self-Play Preference Optimization for Language Model Alignment
arXiv 2024
Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward
arXiv 2024
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision
arXiv 2024
Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models
arXiv 2024
HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild
arXiv 2024
Active Retrieval Augmented Generation
arXiv 2023
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision
principle-driven-self-alignment-of-language
SALMON: Self-Alignment with Instructable Reward Models
arXiv 2023
AutoMix: Automatically Mixing Language Models
arXiv 2023
Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMs
arXiv 2023
A Neural PDE Solver with Temporal Stencil Modeling
arXiv 2023
Recitation-Augmented Language Models
arXiv 2022
Memory-assisted prompt editing to improve GPT-3 after deployment
arXiv 2022
Language Models of Code are Few-Shot Commonsense Learners
arXiv 2022
MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices
mobilebert-a-compact-task-agnostic-bert-for-1
Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing
NeurIPS 2020 12
VIOLIN: A Large-Scale Dataset for Video-and-Language Inference
violin-a-large-scale-dataset-for-video-and-1
Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context
transformer-xl-attentive-language-models-1
XLNet: Generalized Autoregressive Pretraining for Language Understanding
xlnet-generalized-autoregressive-pretraining-1
DARTS: Differentiable Architecture Search
darts-differentiable-architecture-search-1
Modeling Long- and Short-Term Temporal Patterns with Deep Neural Networks
arXiv 2017
Affiliations
Frequent co-authors
10from 31 papers