0

Min Yang

Papers
40

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
40papers

Authored papers

40

Visual Merit or Linguistic Crutch? A Close Look at DeepSeek-OCR

arXiv 2026

2026

FlowPIE: Test-Time Scientific Idea Evolution with Flow-Guided Literature Exploration

arXiv 2026

2026

InteractWeb-Bench: Can Multimodal Agent Escape Blind Execution in Interactive Website Generation?

arXiv 2026

2026

OmniCharacter: Towards Immersive Role-Playing Agents with Seamless Speech-Language Personality Interaction

arXiv 2025

2025

OpenOmni: Large Language Models Pivot Zero-shot Omnimodal Alignment across Language with Real-time Self-Aware Emotional Speech Synthesis

arXiv 2025

2025

R1-ShareVL: Incentivizing Reasoning Capability of Multimodal Large Language Models via Share-GRPO

arXiv 2025

2025

IPBench: Benchmarking the Knowledge of Large Language Models in Intellectual Property

arXiv 2025

2025

PEToolLLM: Towards Personalized Tool Learning in Large Language Models

arXiv 2025

2025

Implicit Actor Critic Coupling via a Supervised Learning Framework for RLVR

arXiv 2025

2025

Distillation Quantification for Large Language Models

arXiv 2025

2025

Are Large Reasoning Models Good Translation Evaluators? Analysis and Performance Boost

arXiv 2025

2025

ChARM: Character-based Act-adaptive Reward Modeling for Advanced Role-Playing Language Agents

arXiv 2025

2025

SWE-Flow: Synthesizing Software Engineering Data in a Test-Driven Manner

arXiv 2025

2025

VIPER: Process-aware Evaluation for Generative Video Reasoning

arXiv 2025

2025

CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological Counseling

arXiv 2024

2024

Agents in Software Engineering: Survey, Landscape, and Vision

arXiv 2024

2024

AutoPatent: A Multi-Agent Framework for Automatic Patent Generation

arXiv 2024

2024

Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA

arXiv 2024

2024

Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models

arXiv 2024

2024

AgentCourt: Simulating Court with Adversarial Evolvable Lawyer Agents

arXiv 2024

2024

Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language Models

arXiv 2024

2024

DEEM: Diffusion Models Serve as the Eyes of Large Language Models for Image Perception

arXiv 2024

2024

Training on the Benchmark Is Not All You Need

arXiv 2024

2024

LIME: Less Is More for MLLM Evaluation

arXiv 2024

2024

CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation

arXiv 2024

2024

CollectiveSFT: Scaling Large Language Models for Chinese Medical Benchmark with Collective Instructions in Healthcare

arXiv 2024

2024

II-Bench: An Image Implication Understanding Benchmark for Multimodal Large Language Models

arXiv 2024

2024

Can MLLMs Understand the Deep Implication Behind Chinese Images?

arXiv 2024

2024

CPsyExam: A Chinese Benchmark for Evaluating Psychology using Examinations

arXiv 2024

2024

CLHA: A Simple yet Effective Contrastive Learning Framework for Human Alignment

arXiv 2024

2024

PaCE: Unified Multi-modal Dialogue Pre-training with Progressive and Compositional Experts

arXiv 2023

2023

Valley: Video Assistant with Large Language model Enhanced abilitY

arXiv 2023

2023

Marathon: A Race Through the Realm of Long Context with Large Language Models

arXiv 2023

2023

JADE: A Linguistics-based Safety Evaluation Platform for Large Language Models

arXiv 2023

2023

One-Shot Learning as Instruction Data Prospector for Large Language Models

arXiv 2023

2023

Iterative Forward Tuning Boosts In-Context Learning in Language Models

arXiv 2023

2023

Contrastive variational information bottleneck for aspect-based sentiment analysis

arXiv 2023

2023

GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning and Explicit Policy Injection

arXiv 2021

2021

DOLG: Single-Stage Image Retrieval with Deep Orthogonal Fusion of Local and Global Features

ICCV 2021 10

2021

Improving Knowledge-aware Dialogue Generation via Knowledge Base Question Answering

arXiv 2019

2019

Affiliations

No known affiliations.

Frequent co-authors

10

from 40 papers