0

Yiming Yang

CMU professor working on machine learning, NLP, and information retrieval; senior author on many LLM-evaluation and post-training papers.

Role
professor
Papers
31

Cite

Notes

Only stored in your browser.

31papers

Authored papers

31

TFG-Flow: Training-free Guidance in Multimodal Generative Flow

arXiv 2025

2025

Towards Community-Driven Agents for Machine Learning Engineering

arXiv 2025

2025

Improving Retrieval-Augmented Generation through Multi-Agent Reinforcement Learning

arXiv 2025

2025

CO-Bench: Benchmarking Language Model Agents in Algorithm Search for Combinatorial Optimization

arXiv 2025

2025

Enhancing Training Data Attribution with Representational Optimization

arXiv 2025

2025

CodePDE: An Inference Framework for LLM-driven PDE Solver Generation

arXiv 2025

2025

Deep Research: A Systematic Survey

arXiv 2025

2025

Regularized Langevin Dynamics for Combinatorial Optimization

arXiv 2025

2025

Agentic-R1: Distilled Dual-Strategy Reasoning

arXiv 2025

2025

Improve Vision Language Model Chain-of-thought Reasoning

arXiv 2024

2024

Self-Play Preference Optimization for Language Model Alignment

arXiv 2024

2024

Direct Preference Optimization of Video Large Multimodal Models from Language Model Reward

arXiv 2024

2024

Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision

arXiv 2024

2024

Inference Scaling Laws: An Empirical Analysis of Compute-Optimal Inference for Problem-Solving with Language Models

arXiv 2024

2024

HaluEval-Wild: Evaluating Hallucinations of Language Models in the Wild

arXiv 2024

2024

Active Retrieval Augmented Generation

arXiv 2023

2023

Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision

principle-driven-self-alignment-of-language

2023

SALMON: Self-Alignment with Instructable Reward Models

arXiv 2023

2023

AutoMix: Automatically Mixing Language Models

arXiv 2023

2023

Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning and Coding with LLMs

arXiv 2023

2023

A Neural PDE Solver with Temporal Stencil Modeling

arXiv 2023

2023

Recitation-Augmented Language Models

arXiv 2022

2022

Memory-assisted prompt editing to improve GPT-3 after deployment

arXiv 2022

2022

Language Models of Code are Few-Shot Commonsense Learners

arXiv 2022

2022

MobileBERT: a Compact Task-Agnostic BERT for Resource-Limited Devices

mobilebert-a-compact-task-agnostic-bert-for-1

2020

Funnel-Transformer: Filtering out Sequential Redundancy for Efficient Language Processing

NeurIPS 2020 12

2020

VIOLIN: A Large-Scale Dataset for Video-and-Language Inference

violin-a-large-scale-dataset-for-video-and-1

2020

Transformer-XL: Attentive Language Models Beyond a Fixed-Length Context

transformer-xl-attentive-language-models-1

2019

XLNet: Generalized Autoregressive Pretraining for Language Understanding

xlnet-generalized-autoregressive-pretraining-1

2019

DARTS: Differentiable Architecture Search

darts-differentiable-architecture-search-1

2018

Modeling Long- and Short-Term Temporal Patterns with Deep Neural Networks

arXiv 2017

2017

Affiliations

Currently at

Carnegie Mellon University

professor · university lab

Frequent co-authors

10

from 31 papers