0

Hua Wu

Papers
25

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
25papers

Authored papers

25

KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance

arXiv 2026

2026

MEnvAgent: Scalable Polyglot Environment Construction for Verifiable Software Engineering

arXiv 2026

2026

Curiosity-Driven Reinforcement Learning from Human Feedback

arXiv 2025

2025

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

arXiv 2024

2024

Autoregressive Pre-Training on Pixels and Texts

arXiv 2024

2024

Towards Boosting Many-to-Many Multilingual Machine Translation with Large Language Models

arXiv 2024

2024

Investigating the Factual Knowledge Boundary of Large Language Models with Retrieval Augmentation

arXiv 2023

2023

Tool-Augmented Reward Modeling

arXiv 2023

2023

ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages

arXiv 2022

2022

ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding

arXiv 2022

2022

Long Time No See! Open-Domain Conversation with Long-Term Persona Memory

Findings (ACL) 2022 5

2022

DuReader_retrieval: A Large-scale Chinese Benchmark for Passage Retrieval from Web Search Engine

arXiv 2022

2022

Q-TOD: A Query-driven Task-oriented Dialogue System

arXiv 2022

2022

CDConv: A Benchmark for Contradiction Detection in Chinese Conversations

arXiv 2022

2022

ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation

arXiv 2021

2021

Building Chinese Biomedical Language Models via Multi-Level Text Discrimination

arXiv 2021

2021

DuRecDial 2.0: A Bilingual Parallel Corpus for Conversational Recommendation

EMNLP 2021 11

2021

UNIMO: Towards Unified-Modal Understanding and Generation via Cross-Modal Contrastive Learning

ACL 2021 5

2020

ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding

NAACL 2021 4

2020

ERNIE-Doc: A Retrospective Long-Document Modeling Transformer

ACL 2021 5

2020

SKEP: Sentiment Knowledge Enhanced Pre-training for Sentiment Analysis

skep-sentiment-knowledge-enhanced-pre-1

2020

ERNIE: Enhanced Representation through Knowledge Integration

arXiv 2019

2019

ERNIE 2.0: A Continual Pre-training Framework for Language Understanding

arXiv 2019

2019

PLATO: Pre-trained Dialogue Generation Model with Discrete Latent Variable

plato-pre-trained-dialogue-generation-model-1

2019

Know More about Each Other: Evolving Dialogue Strategy via Compound Assessment

know-more-about-each-other-evolving-dialogue

2019

Affiliations

No known affiliations.

Frequent co-authors

10

from 25 papers