Peng Zhang
- Papers
- 27
Cite
Notes
Only stored in your browser.
Authored papers
27SkillX: Automatically Constructing Skill Knowledge Bases for Agents
arXiv 2026
SkillNet: Create, Evaluate, and Connect AI Skills
arXiv 2026
GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents
arXiv 2026
Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO
arXiv 2026
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning
preprint
Fast-Slow Thinking for Large Vision-Language Model Reasoning
arXiv 2025
GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
arXiv 2025
MoHoBench: Assessing Honesty of Multimodal Large Language Models via Unanswerable Visual Questions
arXiv 2025
Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning
arXiv 2025
DeepSeek-V3 Technical Report
arXiv 2024
DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model
arXiv 2024
BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language?
arXiv 2024
ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools
arXiv 2024
CogVLM2: Visual Language Models for Image and Video Understanding
arXiv 2024
PuLID: Pure and Lightning ID Customization via Contrastive Alignment
arXiv 2024
QQQ: Quality Quattuor-Bit Quantization for Large Language Models
arXiv 2024
Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval
arXiv 2024
Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning
arXiv 2024
Improving Semi-Supervised Semantic Segmentation with Dual-Level Siamese Structure Network
arXiv 2023
MuseChat: A Conversational Music Recommendation System for Videos
CVPR 2024 1
Tackling Data Bias in MUSIC-AVQA: Crafting a Balanced Dataset for Unbiased Question-Answering
arXiv 2023
WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences
arXiv 2023
MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing
arXiv 2023
DART: Articulated Hand Model with Diverse Accessories and Rich Textures
arXiv 2022
DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing
arXiv 2022
GLM-130B: An Open Bilingual Pre-trained Model
arXiv 2022
GraphNAS: Graph Neural Architecture Search with Reinforcement Learning
arXiv 2019
Affiliations
Frequent co-authors
10from 27 papers