Ang Li

ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model

arXiv 2026

Demystifying When Pruning Works via Representation Hierarchies

arXiv 2026

On the Reliability of Computer Use Agents

arXiv 2026

STEP3-VL-10B Technical Report

arXiv 2026

The Unreasonable Effectiveness of Scaling Agents for Computer Use

arXiv 2025

Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQL

arXiv 2025

Twin-2K-500: A dataset for building digital twins of over 2,000 people based on their answers to over 500 questions

arXiv 2025

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

arXiv 2025

CoIn: Counting the Invisible Reasoning Tokens in Commercial Opaque LLM APIs

arXiv 2025

VeriReason: Reinforcement Learning with Testbench Feedback for Reasoning-Enhanced Verilog Generation

verireason-reinforcement-learning-with

Understanding and Harnessing Sparsity in Unified Multimodal Models

arXiv 2025

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

arXiv 2025

Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning

arXiv 2025

Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents

arXiv 2025

Making Large Language Models Efficient Dense Retrievers

arXiv 2025

AppealCase: A Dataset and Benchmark for Civil Case Appeal Scenarios

arXiv 2025

Domino: Eliminating Communication in LLM Training via Generic Tensor Slicing and Overlapping

arXiv 2024

Enhancing One-Shot Federated Learning Through Data and Ensemble Co-Boosting

arXiv 2024

Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers

arXiv 2024

What Matters in Transformers? Not All Attention is Needed

arXiv 2024

On Scaling Up 3D Gaussian Splatting Training

arXiv 2024

MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding

arXiv 2024

Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents

arXiv 2024

Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild

arXiv 2024

PersonalLLM: Tailoring LLMs to Individual Preferences

arXiv 2024

FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank Adaptations

arXiv 2024