Dian Yu
- Papers
- 14
Cite
Notes
Only stored in your browser.
Authored papers
14Self-Rewarding Vision-Language Model via Reasoning Decomposition
arXiv 2025
DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning
arXiv 2025
Evolving Language Models without Labels: Majority Drives Selection, Novelty Promotes Variation
arXiv 2025
Don't Get Lost in the Trees: Streamlining LLM Reasoning by Overcoming Tree Search Exploration Pitfalls
arXiv 2025
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
arXiv 2024
MathChat: Benchmarking Mathematical Reasoning and Instruction Following in Multi-Turn Interactions
arXiv 2024
DOTS: Learning to Reason Dynamically in LLMs via Optimal Reasoning Trajectories Search
arXiv 2024
Scaling Synthetic Data Creation with 1,000,000,000 Personas
arXiv 2024
Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning
arXiv 2024
Salute the Classic: Revisiting Challenges of Machine Translation in the Age of Large Language Models
arXiv 2024
Tree of Thoughts: Deliberate Problem Solving with Large Language Models
tree-of-thoughts-deliberate-problem-solving
Sub-Sentence Encoder: Contrastive Learning of Propositional Semantic Representations
arXiv 2023
ReAct: Synergizing Reasoning and Acting in Language Models
arXiv 2022
CLUE: A Chinese Language Understanding Evaluation Benchmark
COLING 2020 8
Affiliations
Frequent co-authors
10from 14 papers