Yu Sun
- Papers
- 28
Cite
Notes
Only stored in your browser.
Authored papers
28Learning to Discover at Test Time
arXiv 2026
CLEAR: Unlocking Generative Potential for Degraded Image Understanding in Unified Multimodal Models
arXiv 2026
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance
arXiv 2026
End-to-End Test-Time Training for Long Context
arXiv 2025
CritiQ: Mining Data Quality Criteria from Human Preferences
arXiv 2025
Curiosity-Driven Reinforcement Learning from Human Feedback
arXiv 2025
One-Minute Video Generation with Test-Time Training
CVPR 2025 1
ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning
arXiv 2025
Learning to (Learn at Test Time): RNNs with Expressive Hidden States
arXiv 2024
MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions
arXiv 2024
F-Eval: Assessing Fundamental Abilities with Refined Evaluation Methods
arXiv 2024
Autoregressive Pre-Training on Pixels and Texts
arXiv 2024
TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose Representation
CVPR 2024 1
CoLLiE: Collaborative Training of Large Language Models in an Efficient Way
arXiv 2023
Learning to (Learn at Test Time)
arXiv 2023
Tool-Augmented Reward Modeling
arXiv 2023
Test-Time Training on Nearest Neighbors for Large Language Models
arXiv 2023
ERNIE-Code: Beyond English-Centric Cross-lingual Pretraining for Programming Languages
arXiv 2022
ERNIE-Layout: Layout Knowledge Enhanced Pre-training for Visually-rich Document Understanding
arXiv 2022
ERNIE 3.0 Titan: Exploring Larger-scale Knowledge Enhanced Pre-training for Language Understanding and Generation
arXiv 2021
Putting People in their Place: Monocular Regression of 3D People in Depth
CVPR 2022 1
ERNIE-Gram: Pre-Training with Explicitly N-Gram Masked Language Modeling for Natural Language Understanding
NAACL 2021 4
ERNIE-Doc: A Retrospective Long-Document Modeling Transformer
ACL 2021 5
ERNIE: Enhanced Representation through Knowledge Integration
arXiv 2019
ERNIE 2.0: A Continual Pre-training Framework for Language Understanding
arXiv 2019
Test-Time Training with Self-Supervision for Generalization under Distribution Shifts
arXiv 2019
On Calibration of Modern Neural Networks
on-calibration-of-modern-neural-networks-1
Deep Networks with Stochastic Depth
arXiv 2016
Affiliations
Frequent co-authors
10from 28 papers