Baolin Peng
- Papers
- 22
Cite
Notes
Only stored in your browser.
Authored papers
22Orchard: An Open-Source Agentic Modeling Framework
arXiv 2026
Magma: A Foundation Model for Multimodal AI Agents
CVPR 2025 1
Reinforcement Learning for Reasoning in Large Language Models with One Training Example
arXiv 2025
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
arXiv 2025
ThetaEvolve: Test-time Learning on Open Problems
arXiv 2025
Adapting Web Agents with Synthetic Supervision
arXiv 2025
Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation
arXiv 2025
On the Emergence of Thinking in LLMs I: Searching for the Right Intuition
arXiv 2025
ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning
arXiv 2024
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
arXiv 2024
Self-Tuning: Instructing LLMs to Effectively Acquire New Knowledge through Self-Teaching
arXiv 2024
Instruction Tuning with GPT-4
arXiv 2023
Guiding Large Language Models via Directional Stimulus Prompting
guiding-large-language-models-via-directional
Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations
arXiv 2023
Self-Checker: Plug-and-Play Modules for Fact-Checking with Large Language Models
arXiv 2023
Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection
arXiv 2023
The Trickle-down Impact of Reward (In-)consistency on RLHF
arXiv 2023
Teaching Language Models to Self-Improve through Interactive Demonstrations
arXiv 2023
Z-Code++: A Pre-trained Language Model Optimized for Abstractive Summarization
arXiv 2022
GODEL: Large-Scale Pre-Training for Goal-Directed Dialog
arXiv 2022
ConvLab-3: A Flexible Dialogue System Toolkit Based on a Unified Data Format
arXiv 2022
Few-shot Natural Language Generation for Task-Oriented Dialog
Findings of the Association for Computational Linguistics 2020
Affiliations
Frequent co-authors
10from 22 papers