Gang Li
- Papers
- 18
Cite
Notes
Only stored in your browser.
Authored papers
18MiDashengLM: Efficient Audio Understanding with General Audio Captions
arXiv 2025
DisCO: Reinforcing Large Reasoning Models with Discriminative Constrained Optimization
arXiv 2025
GLAP: General contrastive audio-text pretraining across domains and languages
arXiv 2025
Incentivizing Reasoning for Advanced Instruction-Following of Large Language Models
arXiv 2025
Training-Free Group Relative Policy Optimization
arXiv 2025
Youtu-Agent: Scaling Agent Productivity with Automated Generation and Hybrid Policy Optimization
arXiv 2025
Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question Answering
arXiv 2025
SmartSnap: Proactive Evidence Seeking for Self-Verifying Agents
arXiv 2025
Unleashing the Power of Data Tsunami: A Comprehensive Survey on Data Assessment and Selection for Instruction Tuning of Language Models
arXiv 2024
FreeStyle: Free Lunch for Text-guided Style Transfer using Diffusion Models
arXiv 2024
CARES: A Comprehensive Benchmark of Trustworthiness in Medical Vision Language Models
arXiv 2024
PDT: Uav Target Detection Dataset for Pests and Diseases Tree
arXiv 2024
RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models
arXiv 2024
Leveraging Open Knowledge for Advancing Task Expertise in Large Language Models
arXiv 2024
IterativePFN: True Iterative Point Cloud Filtering
CVPR 2023 1
Post Quantum Secure Blockchain-based Federated Learning for Mobile Edge Computing
arXiv 2023
Screen2Words: Automatic Mobile UI Summarization with Multimodal Learning
arXiv 2021
Widget Captioning: Generating Natural Language Description for Mobile User Interface Elements
EMNLP 2020 11
Affiliations
Frequent co-authors
10from 18 papers