Xiaokang Zhang
- Papers
- 12
Cite
Notes
Only stored in your browser.
Authored papers
12DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
preprint
OmniSQL: Synthesizing High-quality Text-to-SQL Data at Scale
arXiv 2025
Dynamic Scaling of Unit Tests for Code Reward Modeling
arXiv 2025
DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning
arXiv 2025
CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis
arXiv 2025
Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL
arXiv 2025
DeepSeek-V3 Technical Report
arXiv 2024
CodeS: Towards Building Open-source Language Models for Text-to-SQL
arXiv 2024
SAM Decoding: Speculative Decoding via Suffix Automaton
arXiv 2024
MANet: Fine-Tuning Segment Anything Model for Multimodal Remote Sensing Semantic Segmentation
arXiv 2024
TableLLM: Enabling Tabular Data Manipulation by LLMs in Real Office Usage Scenarios
arXiv 2024
GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue Generation
arXiv 2023
Affiliations
Frequent co-authors
10from 12 papers