Zhijiang Guo
- Papers
- 21
Cite
Notes
Only stored in your browser.
Authored papers
21EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL
arXiv 2026
CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models
arXiv 2026
TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenarios
arXiv 2025
Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR
arXiv 2025
TL;DR: Too Long, Do Re-weighting for Efficient LLM Reasoning Compression
arXiv 2025
From System 1 to System 2: A Survey of Reasoning Large Language Models
arXiv 2025
TreeRPO: Tree Relative Policy Optimization
arXiv 2025
ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall
arXiv 2025
Knowledge Conflicts for LLMs: A Survey
arXiv 2024
Process-Driven Autoformalization in Lean 4
arXiv 2024
OptiBench Meets ReSocratic: Measure and Improve LLMs for Optimization Modeling
arXiv 2024
EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning
arXiv 2024
Learning From Correctness Without Prompting Makes LLM Efficient Reasoner
arXiv 2024
Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators
arXiv 2024
MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation
arXiv 2024
PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models
arXiv 2024
EffiLearner: Enhancing Efficiency of Generated Code via Self-Optimization
arXiv 2024
AVeriTeC: A Dataset for Real-world Claim Verification with Evidence from the Web
averitec-a-dataset-for-real-world-claim
TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models
arXiv 2023
DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for In-Context Learning
arXiv 2023
FEVEROUS: Fact Extraction and VERification Over Unstructured and Structured information
arXiv 2021
Affiliations
Frequent co-authors
10from 21 papers