0

Peng Zhang

Papers
27

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
27papers

Authored papers

27

SkillX: Automatically Constructing Skill Knowledge Bases for Agents

arXiv 2026

2026

SkillNet: Create, Evaluate, and Connect AI Skills

arXiv 2026

2026

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

arXiv 2026

2026

Alleviating Sparse Rewards by Modeling Step-Wise and Long-Term Sampling Effects in Flow-Based GRPO

arXiv 2026

2026

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

preprint

2025

Fast-Slow Thinking for Large Vision-Language Model Reasoning

arXiv 2025

2025

GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

arXiv 2025

2025

MoHoBench: Assessing Honesty of Multimodal Large Language Models via Unanswerable Visual Questions

arXiv 2025

2025

Agentar-Fin-R1: Enhancing Financial Intelligence through Domain Expertise, Training Efficiency, and Advanced Reasoning

arXiv 2025

2025

DeepSeek-V3 Technical Report

arXiv 2024

2024

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

arXiv 2024

2024

BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language?

arXiv 2024

2024

ChatGLM: A Family of Large Language Models from GLM-130B to GLM-4 All Tools

arXiv 2024

2024

CogVLM2: Visual Language Models for Image and Video Understanding

arXiv 2024

2024

PuLID: Pure and Lightning ID Customization via Contrastive Alignment

arXiv 2024

2024

QQQ: Quality Quattuor-Bit Quantization for Large Language Models

arXiv 2024

2024

Focus, Distinguish, and Prompt: Unleashing CLIP for Efficient and Flexible Scene Text Retrieval

arXiv 2024

2024

Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning

arXiv 2024

2024

Improving Semi-Supervised Semantic Segmentation with Dual-Level Siamese Structure Network

arXiv 2023

2023

MuseChat: A Conversational Music Recommendation System for Videos

CVPR 2024 1

2023

Tackling Data Bias in MUSIC-AVQA: Crafting a Balanced Dataset for Unbiased Question-Answering

arXiv 2023

2023

WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human Preferences

arXiv 2023

2023

MaTe3D: Mask-guided Text-based 3D-aware Portrait Editing

arXiv 2023

2023

DART: Articulated Hand Model with Diverse Accessories and Rich Textures

arXiv 2022

2022

DyStyle: Dynamic Neural Network for Multi-Attribute-Conditioned Style Editing

arXiv 2022

2022

GLM-130B: An Open Bilingual Pre-trained Model

arXiv 2022

2022

GraphNAS: Graph Neural Architecture Search with Reinforcement Learning

arXiv 2019

2019

Affiliations

No known affiliations.

Frequent co-authors

10

from 27 papers