Yiming Yang

CMU professor working on machine learning, NLP, and information retrieval; senior author on many LLM-evaluation and post-training papers.

Role: professor
Currently at: Carnegie Mellon University
Scholar: scholar.google.com/citations
Papers: 31

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: scholar.google.com/citations

Attribution policy →

31papers

Authored papers

31

CodePDE: An Inference Framework for LLM-driven PDE Solver Generation

arXiv 2025

Deep Research: A Systematic Survey

arXiv 2025

Towards Community-Driven Agents for Machine Learning Engineering

arXiv 2025

Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning

arXiv 2025

CO-Bench: Benchmarking Language Model Agents in Algorithm Search for Combinatorial Optimization

arXiv 2025

Agentic-R1: Distilled Dual-Strategy Reasoning

arXiv 2025

TFG-Flow: Training-free Guidance in Multimodal Generative Flow

arXiv 2025

Enhancing Training Data Attribution with Representational Optimization

arXiv 2025

Regularized Langevin Dynamics for Combinatorial Optimization

arXiv 2025

Improve Vision Language Model Chain-of-thought Reasoning

arXiv 2024

Self-Play Preference Optimization for Language Model Alignment

arXiv 2024

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

arXiv 2024

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

arXiv 2024

Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models

arXiv 2024

HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild

arXiv 2024

Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision

principle-driven-self-alignment-of-language

Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMs

arXiv 2023

Active Retrieval Augmented Generation

arXiv 2023

SALMON: Self-Alignment with Instructable Reward Models

arXiv 2023

AutoMix: Automatically Mixing Language Models

arXiv 2023

A Neural PDE Solver with Temporal Stencil Modeling

arXiv 2023

Recitation-Augmented Language Models

arXiv 2022

Memory-assisted prompt editing to improve GPT-3 after deployment

arXiv 2022

Language Models of Code are Few-Shot Commonsense Learners

arXiv 2022

MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices

mobilebert-a-compact-task-agnostic-bert-for-1

Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing

NeurIPS 2020 12

VIOLIN: A Large-Scale Dataset for Video-and-Language Inference

violin-a-large-scale-dataset-for-video-and-1

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

transformer-xl-attentive-language-models-1

XLNet: Generalized Autoregressive Pretraining for Language Understanding

xlnet-generalized-autoregressive-pretraining-1

DARTS: Differentiable Architecture Search

darts-differentiable-architecture-search-1

Modeling Long- and Short-Term Temporal Patterns with Deep Neural Networks

arXiv 2017

Affiliations

Currently at

Carnegie Mellon University

professor · university lab

Frequent co-authors

10

from 31 papers

Zhiqing Sun

researcher

12 shared papers

Weiwei Sun

6 shared papers

Shanda Li

5 shared papers

Aman Madaan

4 shared papers

Chuang Gan

3 shared papers

Pranjal Aggarwal

3 shared papers

Quoc V. Le

3 shared papers

Sean Welleck

3 shared papers

Yikang Shen

3 shared papers

Zihang Dai

3 shared papers