Baolin Peng

Papers: 22

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

22papers

Authored papers

Orchard: An Open-Source Agentic Modeling Framework

arXiv 2026

2026

Magma: A Foundation Model for Multimodal AI Agents

CVPR 2025 1

2025

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

arXiv 2025

2025

GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents

arXiv 2025

2025

ThetaEvolve: Test-time Learning on Open Problems

arXiv 2025

2025

Adapting Web Agents with Synthetic Supervision

arXiv 2025

2025

On the Emergence of Thinking in LLMs I: Searching for the Right Intuition

arXiv 2025

2025

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

arXiv 2025

2025

ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning

arXiv 2024

2024

Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing

arXiv 2024

2024

Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-Teaching

arXiv 2024

2024

Guiding Large Language Models via Directional Stimulus Prompting

guiding-large-language-models-via-directional

2023

Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations

arXiv 2023

2023

The Trickle-down Impact of Reward (In-)consistency on RLHF

arXiv 2023

2023

Instruction Tuning with GPT-4

arXiv 2023

2023

Teaching Language Models to Self-Improve through Interactive Demonstrations

arXiv 2023

2023

Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models

arXiv 2023

2023

Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection

arXiv 2023

2023

Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization

arXiv 2022

2022

GODEL: Large-Scale Pre-Training for Goal-Directed Dialog

arXiv 2022

2022

ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format

arXiv 2022

2022

Few-shot Natural Language Generation for Task-Oriented Dialog

Findings of the Association for Computational Linguistics 2020

2020

Affiliations

No known affiliations.

Frequent co-authors

from 22 papers

Jianfeng Gao

Pengcheng He

Michel Galley

Hao Cheng

Qianhui Wu

Yelong Shen

Zhou Yu

Dong Yu

Haitao Mi

Lars Liden