Libo Qin
- Papers
- 25
Cite
Notes
Only stored in your browser.
Authored papers
25Retrieval-Infused Reasoning Sandbox: A Benchmark for Decoupling Retrieval and Reasoning Capabilities
arXiv 2026
Stratagem: Learning Transferable Reasoning via Trajectory-Modulated Game Self-Play
arXiv 2026
SAVOIR: Learning Social Savoir-Faire via Shapley-based Reward Attribution
arXiv 2026
Towards Reasoning Era: A Survey of Long Chain-of-Thought for Reasoning Large Language Models
arXiv 2025
AutoPR: Let's Automate Your Academic Promotion!
arXiv 2025
MAC-SLU: Multi-Intent Automotive Cabin Spoken Language Understanding Benchmark
arXiv 2025
CCHall: A Novel Benchmark for Joint Cross-Lingual and Cross-Modal Hallucinations Detection in Large Language Models
arXiv 2025
Beyond Correctness: Evaluating Subjective Writing Preferences Across Cultures
arXiv 2025
COIG-Writer: A High-Quality Dataset for Chinese Creative Writing with Thought Processes
arXiv 2025
M$^3$CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought
arXiv 2024
Unlocking the Capabilities of Thought: A Reasoning Boundary Framework to Quantify and Optimize Chain-of-Thought
arXiv 2024
KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing
arXiv 2024
What are the Essential Factors in Crafting Effective Long Context Multi-Hop Instruction Datasets? Insights and Best Practices
arXiv 2024
CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models
arXiv 2024
Self-Constructed Context Decompilation with Fined-grained Alignment Enhancement
arXiv 2024
GlobeSumm: A Challenging Benchmark Towards Unifying Multi-lingual, Cross-lingual and Multi-document News Summarization
arXiv 2024
Exploring Multi-Grained Concept Annotations for Multimodal Large Language Models
arXiv 2024
LasUIE: Unifying Information Extraction with Latent Adaptive Structure-aware Generative Language Model
arXiv 2023
Cross-lingual Prompting: Improving Zero-shot Chain-of-Thought Reasoning across Languages
arXiv 2023
GLUE-X: Evaluating Natural Language Understanding Models from an Out-of-distribution Generalization Perspective
arXiv 2022
Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding
arXiv 2021
FewCLUE: A Chinese Few-shot Learning Evaluation Benchmark
arXiv 2021
A Survey on Spoken Language Understanding: Recent Advances and New Frontiers
arXiv 2021
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
arXiv 2021
N-LTP: An Open-source Neural Language Technology Platform for Chinese
EMNLP (ACL) 2021 11
Affiliations
Frequent co-authors
10from 25 papers