0

Zhijiang Guo

Papers
21

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
21papers

Authored papers

21

EnvFactory: Scaling Tool-Use Agents via Executable Environments Synthesis and Robust RL

arXiv 2026

2026

CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models

arXiv 2026

2026

TIME: A Multi-level Benchmark for Temporal Reasoning of LLMs in Real-World Scenarios

arXiv 2025

2025

Beyond Pass@1: Self-Play with Variational Problem Synthesis Sustains RLVR

arXiv 2025

2025

TL;DR: Too Long, Do Re-weighting for Efficient LLM Reasoning Compression

arXiv 2025

2025

From System 1 to System 2: A Survey of Reasoning Large Language Models

arXiv 2025

2025

TreeRPO: Tree Relative Policy Optimization

arXiv 2025

2025

ACE: Attribution-Controlled Knowledge Editing for Multi-hop Factual Recall

arXiv 2025

2025

Knowledge Conflicts for LLMs: A Survey

arXiv 2024

2024

Process-Driven Autoformalization in Lean 4

arXiv 2024

2024

OptiBench Meets ReSocratic: Measure and Improve LLMs for Optimization Modeling

arXiv 2024

2024

EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning

arXiv 2024

2024

Learning From Correctness Without Prompting Makes LLM Efficient Reasoner

arXiv 2024

2024

Aligning with Human Judgement: The Role of Pairwise Preference in Large Language Model Evaluators

arXiv 2024

2024

MHPP: Exploring the Capabilities and Limitations of Language Models Beyond Basic Code Generation

arXiv 2024

2024

PROXYQA: An Alternative Framework for Evaluating Long-Form Text Generation with Large Language Models

arXiv 2024

2024

EffiLearner: Enhancing Efficiency of Generated Code via Self-Optimization

arXiv 2024

2024

AVeriTeC: A Dataset for Real-world Claim Verification with Evidence from the Web

averitec-a-dataset-for-real-world-claim

2023

TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models

arXiv 2023

2023

DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for In-Context Learning

arXiv 2023

2023

FEVEROUS: Fact Extraction and VERification Over Unstructured and Structured information

arXiv 2021

2021

Affiliations

No known affiliations.

Frequent co-authors

10

from 21 papers