Jia Li

Researcher associated with code-LLM and reasoning evaluation work (multiple individuals share this name in AI).

Role: researcher
Scholar: scholar.google.com/citations
Papers: 39

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: scholar.google.com/citations

Attribution policy →

39papers·1tool contribs

Authored papers

39

Training LLM Agents for Spontaneous, Reward-Free Self-Evolution via World Knowledge Exploration

arXiv 2026

Scaling the Scaling Logic: Agentic Meta-Synthesis of Logic Reasoning

arXiv 2026

Numina-Lean-Agent: An Open and General Agentic Reasoning System for Formal Mathematics

arXiv 2026

Kimi K2.5: Visual Agentic Intelligence

arXiv 2026

Goedel-Prover: A Frontier Model for Open-Source Automated Theorem Proving

arXiv 2025

Kimina-Prover Preview: Towards Large Formal Reasoning Models with Reinforcement Learning

arXiv 2025

ForCenNet: Foreground-Centric Network for Document Image Rectification

ICCV 2025

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

arXiv 2025

BatteryLife: A Comprehensive Dataset and Benchmark for Battery Life Prediction

arXiv 2025

Baichuan-Omni-1.5 Technical Report

arXiv 2025

DEER: Draft with Diffusion, Verify with Autoregressive Models

arXiv 2025

VAEmo: Efficient Representation Learning for Visual-Audio Emotion with Knowledge Injection

arXiv 2025

SATURN: SAT-based Reinforcement Learning to Unleash Language Model Reasoning

arXiv 2025

S$^2$R: Teaching LLMs to Self-verify and Self-correct via Reinforcement Learning

arXiv 2025

FANformer: Improving Large Language Models Through Effective Periodicity Modeling

arXiv 2025

CodeSwift: Accelerating LLM Inference for Efficient Code Generation

arXiv 2025

Think-on-Graph 3.0: Efficient and Adaptive LLM Reasoning on Heterogeneous Graphs via Multi-Agent Dual-Evolving Context Retrieval

arXiv 2025

NuminaMath: The Largest Public Dataset in AI4Maths with 860k Pairs of Competition Math Problems and Solutions

blog

Parameter-Efficient Fine-Tuning with Discrete Fourier Transform

arXiv 2024

FAN: Fourier Analysis Networks

arXiv 2024

The Oscars of AI Theater: A Survey on Role-Playing with Language Models

arXiv 2024

Protein Multimer Structure Prediction via Prompt Learning

arXiv 2024

Golden Touchstone: A Comprehensive Bilingual Benchmark for Evaluating Financial Large Language Models

arXiv 2024

Sifting through the Chaff: On Utilizing Execution Feedback for Ranking the Generated Code Candidates

arXiv 2024

GraphWiz: An Instruction-Following Language Model for Graph Problems

arXiv 2024

EvoCodeBench: An Evolving Code Generation Benchmark Aligned with Real-World Code Repositories

arXiv 2024

DevEval: A Manually-Annotated Code Generation Benchmark Aligned with Real-World Code Repositories

arXiv 2024

Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term Conversations

arXiv 2024

4-bit Shampoo for Memory-Efficient Network Training

arXiv 2024

EventRPG: Event Data Augmentation with Relevance Propagation Guidance

arXiv 2024

SantaCoder: don't reach for the stars!

arXiv 2023

Breaking Language Barriers in Multilingual Mathematical Reasoning: Insights and Observations

arXiv 2023

Graph Prompt Learning: A Comprehensive Survey and Beyond

arXiv 2023

Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers

arXiv 2023

SkCoder: A Sketch-based Approach for Automatic Code Generation

arXiv 2023

TriDet: Temporal Action Detection with Relative Boundary Modeling

CVPR 2023 1

Towards Efficient and Scale-Robust Ultra-High-Definition Image Demoireing

arXiv 2022

Large Language Models Meet Harry Potter: A Bilingual Dataset for Aligning Dialogue Agents with Characters

arXiv 2022

ReAct: Temporal Action Detection with Relational Queries

arXiv 2022

Tool contributions

1

NuminaMath

Numina

An 860k-problem competition-math dataset with detailed solutions, the open community's go-to corpus for training math-specialized LLMs.

SFT DatasetMathScientific Reasoning

Affiliations

No known affiliations.

Frequent co-authors

10

from 39 papers

Ge Li

7 shared papers

Nuo Chen

7 shared papers

Yihong Dong

5 shared papers

Zhi Jin

4 shared papers

Hao Zhu

researcher

3 shared papers

Huanyu Liu

3 shared papers

Kechi Zhang

3 shared papers

Longhui Yu

researcher

3 shared papers

Yan Wang

3 shared papers

Cheng Liu

2 shared papers