Hua Wu
- Papers
- 25
Cite
Notes
Only stored in your browser.
Authored papers
25KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance
arXiv 2026
MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering
arXiv 2026
Curiosity-Driven Reinforcement Learning from Human Feedback
arXiv 2025
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
arXiv 2024
Autoregressive Pre-Training on Pixels and Texts
arXiv 2024
Towards Boosting Many-to-Many Multilingual Machine Translation with Large Language Models
arXiv 2024
Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation
arXiv 2023
Tool-Augmented Reward Modeling
arXiv 2023
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages
arXiv 2022
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
arXiv 2022
Long Time No See! Open-Domain Conversation with Long-Term Persona Memory
Findings (ACL) 2022 5
DuReader_retrieval: A Large-scale Chinese Benchmark for Passage Retrieval from Web Search Engine
arXiv 2022
Q-TOD: A Query-driven Task-oriented Dialogue System
arXiv 2022
CDConv: A Benchmark for Contradiction Detection in Chinese Conversations
arXiv 2022
ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
arXiv 2021
Building Chinese Biomedical Language Models via Multi-Level Text Discrimination
arXiv 2021
DuRecDial 2.0: A Bilingual Parallel Corpus for Conversational Recommendation
EMNLP 2021 11
UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning
ACL 2021 5
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding
NAACL 2021 4
ERNIE-Doc: A Retrospective Long-Document Modeling Transformer
ACL 2021 5
SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis
skep-sentiment-knowledge-enhanced-pre-1
ERNIE: Enhanced Representation through Knowledge Integration
arXiv 2019
ERNIE 2.0: A Continual Pre-training Framework for Language Understanding
arXiv 2019
PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable
plato-pre-trained-dialogue-generation-model-1
Know More about Each Other: Evolving Dialogue Strategy via Compound Assessment
know-more-about-each-other-evolving-dialogue
Affiliations
Frequent co-authors
10from 25 papers