Ruobing Xie

Multi-Grained Patch Training for Efficient LLM-based Recommendation

arXiv 2025

LaSeR: Reinforcement Learning with Last-Token Self-Rewarding

arXiv 2025

The Climb Carves Wisdom Deeper Than the Summit: On the Noisy Rewards in Learning to Reason

arXiv 2025

PhD: A ChatGPT-Prompted Visual hallucination Evaluation Dataset

CVPR 2025 1

DHCP: Detecting Hallucinations by Cross-modal Attention Pattern in Large Vision-Language Models

arXiv 2024

Advancing LLM Reasoning Generalists with Preference Trees

arXiv 2024

Content-Based Collaborative Generation for Recommender Systems

arXiv 2024

Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment

arXiv 2024

Beyond Natural Language: LLMs Leveraging Alternative Formats for Enhanced Reasoning and Communication

arXiv 2024

Continuous Speech Tokenizer in Text To Speech

arXiv 2024

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

arXiv 2024

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

arXiv 2024

More Expressive Attention with Negative Weights

arXiv 2024

AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent Behaviors

arXiv 2023

ToolLLM: Facilitating Large Language Models to Master 16000+ Real-world APIs

arXiv 2023

UltraFeedback: Boosting Language Models with High-quality Feedback

ICML

MAVEN-Arg: Completing the Puzzle of All-in-One Event Understanding Dataset with Event Argument Annotation

arXiv 2023