0

Hao Cheng

Papers
31

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
31papers

Authored papers

31

Orchard: An Open-Source Agentic Modeling Framework

arXiv 2026

2026

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL

arXiv 2026

2026

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

arXiv 2025

2025

Streaming Video Question-Answering with In-context Video KV-Cache Retrieval

arXiv 2025

2025

Token Cleaning: Fine-Grained Data Selection for LLM Supervised Fine-Tuning

arXiv 2025

2025

ThetaEvolve: Test-time Learning on Open Problems

arXiv 2025

2025

Compose Your Policies! Improving Diffusion-based or Flow-based Robot Policies via Test-time Distribution-level Composition

arXiv 2025

2025

Agent KB: Leveraging Cross-Domain Experience for Agentic Problem Solving

arXiv 2025

2025

Decoder-Hybrid-Decoder Architecture for Efficient Reasoning with Long Generation

arXiv 2025

2025

Modality-Composable Diffusion Policy via Inference-Time Distribution-level Composition

arXiv 2025

2025

VAEmo: Efficient Representation Learning for Visual-Audio Emotion with Knowledge Injection

arXiv 2025

2025

TruthPrInt: Mitigating LVLM Object Hallucination Via Latent Truthful-Guided Pre-Intervention

arXiv 2025

2025

Exploring Typographic Visual Prompts Injection Threats in Cross-Modality Generation Models

arXiv 2025

2025

Spiking Neural Network as Adaptive Event Stream Slicer

arXiv 2024

2024

ExACT: Teaching AI Agents to Explore with Reflective-MCTS and Exploratory Learning

arXiv 2024

2024

Spiking Diffusion Models

arXiv 2024

2024

RobustTSF: Towards Theory and Design of Robust Time Series Forecasting with Anomalies

arXiv 2024

2024

Model Tells Itself Where to Attend: Faithfulness Meets Automatic Attention Steering

arXiv 2024

2024

DyFADet: Dynamic Feature Aggregation for Temporal Action Detection

arXiv 2024

2024

Iterative Self-Tuning LLMs for Enhanced Jailbreaking Capabilities

arXiv 2024

2024

Reassessing Layer Pruning in LLMs: New Insights and Methods

arXiv 2024

2024

Spiking Denoising Diffusion Probabilistic Models

arXiv 2023

2023

LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents

arXiv 2023

2023

Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models

arXiv 2023

2023

Augmenting Language Models with Long-Term Memory

augmenting-language-models-with-long-term

2023

AceGPT, Localizing Large Language Models in Arabic

arXiv 2023

2023

Trustworthy LLMs: a Survey and Guideline for Evaluating Large Language Models' Alignment

arXiv 2023

2023

DOS: Diverse Outlier Sampling for Out-of-Distribution Detection

arXiv 2023

2023

Unmasking and Improving Data Credibility: A Study with Datasets for Training Harmless Language Models

arXiv 2023

2023

Language Models as Inductive Reasoners

arXiv 2022

2022

Bi-directional Attention with Agreement for Dependency Parsing

bi-directional-attention-with-agreement-for-1

2016

Affiliations

No known affiliations.

Frequent co-authors

10

from 31 papers