0

Ang Li

Papers
31

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
31papers

Authored papers

31

Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters

arXiv 2026

2026

ThinkJEPA: Empowering Latent World Models with Large Vision-Language Reasoning Model

arXiv 2026

2026

Demystifying When Pruning Works via Representation Hierarchies

arXiv 2026

2026

On the Reliability of Computer Use Agents

arXiv 2026

2026

STEP3-VL-10B Technical Report

arXiv 2026

2026

The Unreasonable Effectiveness of Scaling Agents for Computer Use

arXiv 2025

2025

Arctic-Text2SQL-R1: Simple Rewards, Strong Reasoning in Text-to-SQL

arXiv 2025

2025

Twin-2K-500: A dataset for building digital twins of over 2,000 people based on their answers to over 500 questions

arXiv 2025

2025

CoIn: Counting the Invisible Reasoning Tokens in Commercial Opaque LLM APIs

arXiv 2025

2025

VeriReason: Reinforcement Learning with Testbench Feedback for Reasoning-Enhanced Verilog Generation

verireason-reinforcement-learning-with

2025

Understanding and Harnessing Sparsity in Unified Multimodal Models

arXiv 2025

2025

Teaching Pretrained Language Models to Think Deeper with Retrofitted Recurrence

arXiv 2025

2025

Zebra-CoT: A Dataset for Interleaved Vision Language Reasoning

arXiv 2025

2025

Making Large Language Models Efficient Dense Retrievers

arXiv 2025

2025

Agent S2: A Compositional Generalist-Specialist Framework for Computer Use Agents

arXiv 2025

2025

Every Activation Boosted: Scaling General Reasoner to 1 Trillion Open Language Foundation

arXiv 2025

2025

AppealCase: A Dataset and Benchmark for Civil Case Appeal Scenarios

arXiv 2025

2025

Domino: Eliminating Communication in LLM Training via Generic Tensor Slicing and Overlapping

arXiv 2024

2024

Enhancing One-Shot Federated Learning Through Data and Ensemble Co-Boosting

arXiv 2024

2024

Router-Tuning: A Simple and Effective Approach for Enabling Dynamic-Depth in Transformers

arXiv 2024

2024

What Matters in Transformers? Not All Attention is Needed

arXiv 2024

2024

FLoRA: Federated Fine-Tuning Large Language Models with Heterogeneous Low-Rank Adaptations

arXiv 2024

2024

On Scaling Up 3D Gaussian Splatting Training

arXiv 2024

2024

MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding

arXiv 2024

2024

Mobile-Bench: An Evaluation Benchmark for LLM-based Mobile Agents

arXiv 2024

2024

Model-GLUE: Democratized LLM Scaling for A Large Model Zoo in the Wild

arXiv 2024

2024

PersonalLLM: Tailoring LLMs to Individual Preferences

arXiv 2024

2024

AutoReP: Automatic ReLU Replacement for Fast Private Network Inference

ICCV 2023 1

2023

Dissecting Tensor Cores via Microbenchmarks: Latency, Throughput and Numeric Behaviors

arXiv 2022

2022

M2DGR: A Multi-sensor and Multi-scenario SLAM Dataset for Ground Robots

arXiv 2021

2021

Improved Knowledge Distillation via Teacher Assistant

arXiv 2019

2019

Affiliations

No known affiliations.

Frequent co-authors

10

from 31 papers