Biqing Qi
- Papers
- 18
Cite
Notes
Only stored in your browser.
Authored papers
18DARE: Diffusion Large Language Models Alignment and Reinforcement Executor
arXiv 2026
TTRL: Test-Time Reinforcement Learning
arXiv 2025
Graph Counselor: Adaptive Graph Exploration via Multi-Agent Synergy to Enhance LLM Reasoning
arXiv 2025
InternVL3.5: Advancing Open-Source Multimodal Models in Versatility, Reasoning, and Efficiency
arXiv 2025
ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
arXiv 2025
A Survey of Reinforcement Learning for Large Reasoning Models
arXiv 2025
Sequential Diffusion Language Models
arXiv 2025
ScienceBoard: Evaluating Multimodal Autonomous Agents in Realistic Scientific Workflows
arXiv 2025
Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling
arXiv 2025
GenPRM: Scaling Test-Time Compute of Process Reward Models via Generative Reasoning
arXiv 2025
Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
arXiv 2024
Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding
arXiv 2024
UltraMedical: Building Specialized Generalists in Biomedicine
arXiv 2024
Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent System
arXiv 2024
Fourier Position Embedding: Enhancing Attention's Periodic Extension for Length Generalization
arXiv 2024
Intuitive Fine-Tuning: Towards Simplifying Alignment into a Single Process
arXiv 2024
CRaSh: Clustering, Removing, and Sharing Enhance Fine-tuning without Full Large Language Model
arXiv 2023
PaD: Program-aided Distillation Can Teach Small Models Reasoning Better than Chain-of-thought Fine-tuning
arXiv 2023
Affiliations
Frequent co-authors
10from 18 papers