Jia Li
Researcher associated with code-LLM and reasoning evaluation work (multiple individuals share this name in AI).
- Role
- researcher
- Scholar
- scholar.google.com/citations
- Papers
- 39
Cite
Notes
Only stored in your browser.
Authored papers
39Training LLM Agents for Spontaneous, Reward-Free Self-Evolution via World Knowledge Exploration
arXiv 2026
Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics
arXiv 2026
Scaling the Scaling Logic: Agentic Meta-Synthesis of Logic Reasoning
arXiv 2026
Kimi K2.5: Visual Agentic Intelligence
arXiv 2026
SATURN: SAT-based Reinforcement Learning to Unleash Language Model Reasoning
arXiv 2025
ForCenNet: Foreground-Centric Network for Document Image Rectification
ICCV 2025
Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation
arXiv 2025
BatteryLife: A Comprehensive Dataset and Benchmark for Battery Life Prediction
arXiv 2025
Baichuan-Omni-1.5 Technical Report
arXiv 2025
S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning
arXiv 2025
CodeSwift: Accelerating LLM Inference for Efficient Code Generation
arXiv 2025
Think-on-Graph 3.0: Efficient and Adaptive LLM Reasoning on Heterogeneous Graphs via Multi-Agent Dual-Evolving Context Retrieval
arXiv 2025
Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving
arXiv 2025
Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning
arXiv 2025
FANformer: Improving Large Language Models Through Effective Periodicity Modeling
arXiv 2025
DEER: Draft with Diffusion, Verify with Autoregressive Models
arXiv 2025
VAEmo: Efficient Representation Learning for Visual-Audio Emotion with Knowledge Injection
arXiv 2025
NuminaMath: The Largest Public Dataset in AI4Maths with 860k Pairs of Competition Math Problems and Solutions
blog
Parameter-Efficient Fine-Tuning with Discrete Fourier Transform
arXiv 2024
FAN: Fourier Analysis Networks
arXiv 2024
The Oscars of AI Theater: A Survey on Role-Playing with Language Models
arXiv 2024
GraphWiz: An Instruction-Following Language Model for Graph Problems
arXiv 2024
EvoCodeBench: An Evolving Code Generation Benchmark Aligned with Real-World Code Repositories
arXiv 2024
Sifting through the Chaff: On Utilizing Execution Feedback for Ranking the Generated Code Candidates
arXiv 2024
DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories
arXiv 2024
Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conversations
arXiv 2024
4-bit Shampoo for Memory-Efficient Network Training
arXiv 2024
EventRPG: Event Data Augmentation with Relevance Propagation Guidance
arXiv 2024
Protein Multimer Structure Prediction via Prompt Learning
arXiv 2024
Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models
arXiv 2024
SantaCoder: don't reach for the stars!
arXiv 2023
Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations
arXiv 2023
Graph Prompt Learning: A Comprehensive Survey and Beyond
arXiv 2023
Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers
arXiv 2023
TriDet: Temporal Action Detection with Relative Boundary Modeling
CVPR 2023 1
SkCoder: A Sketch-based Approach for Automatic Code Generation
arXiv 2023
Towards Efficient and Scale-Robust Ultra-High-Definition Image Demoireing
arXiv 2022
Large Language Models Meet Harry Potter: A Bilingual Dataset for Aligning Dialogue Agents with Characters
arXiv 2022
ReAct: Temporal Action Detection with Relational Queries
arXiv 2022
Tool contributions
1Affiliations
Frequent co-authors
10from 39 papers