Ang Li
- Papers
- 31
Cite
Notes
Only stored in your browser.
Authored papers
31Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters
arXiv 2026
ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model
arXiv 2026
Demystifying When Pruning Works via Representation Hierarchies
arXiv 2026
On the Reliability of Computer Use Agents
arXiv 2026
STEP3-VL-10B Technical Report
arXiv 2026
The Unreasonable Effectiveness of Scaling Agents for Computer Use
arXiv 2025
Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQL
arXiv 2025
Twin-2K-500: A dataset for building digital twins of over 2,000 people based on their answers to over 500 questions
arXiv 2025
CoIn: Counting the Invisible Reasoning Tokens in Commercial Opaque LLM APIs
arXiv 2025
VeriReason: Reinforcement Learning with Testbench Feedback for Reasoning-Enhanced Verilog Generation
verireason-reinforcement-learning-with
Understanding and Harnessing Sparsity in Unified Multimodal Models
arXiv 2025
Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence
arXiv 2025
Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning
arXiv 2025
Making Large Language Models Efficient Dense Retrievers
arXiv 2025
Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents
arXiv 2025
Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation
arXiv 2025
AppealCase: A Dataset and Benchmark for Civil Case Appeal Scenarios
arXiv 2025
Domino: Eliminating Communication in LLM Training via Generic Tensor Slicing and Overlapping
arXiv 2024
Enhancing One-Shot Federated Learning Through Data and Ensemble Co-Boosting
arXiv 2024
Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers
arXiv 2024
What Matters in Transformers? Not All Attention is Needed
arXiv 2024
FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank Adaptations
arXiv 2024
On Scaling Up 3D Gaussian Splatting Training
arXiv 2024
MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding
arXiv 2024
Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents
arXiv 2024
Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild
arXiv 2024
PersonalLLM: Tailoring LLMs to Individual Preferences
arXiv 2024
AutoReP: Automatic ReLU Replacement for Fast Private Network Inference
ICCV 2023 1
Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numeric Behaviors
arXiv 2022
M2DGR: A Multi-sensor and Multi-scenario SLAM Dataset for Ground Robots
arXiv 2021
Improved Knowledge Distillation via Teacher Assistant
arXiv 2019
Affiliations
Frequent co-authors
10from 31 papers