0

Wei Wu

Papers
32

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
32papers

Authored papers

32

OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond

arXiv 2026

2026

A Pragmatic VLA Foundation Model

arXiv 2026

2026

MemForest: An Efficient Agent Memory System with Hierarchical Temporal Indexing

arXiv 2026

2026

SEAL: Synergistic Co-Evolution of Agents and Learning Environments

arXiv 2026

2026

DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards

arXiv 2026

2026

Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation

arXiv 2026

2026

PresentAgent-2: Towards Generalist Multimodal Presentation Agents

arXiv 2026

2026

Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing

arXiv 2025

2025

ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding

arXiv 2025

2025

PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model Reasoning

arXiv 2025

2025

RoboScape: Physics-informed Embodied World Model

arXiv 2025

2025

GENERator: A Long-Context Generative Genomic Foundation Model

arXiv 2025

2025

LEMMA: Learning from Errors for MatheMatical Advancement in LLMs

arXiv 2025

2025

DynaAct: Large Language Model Reasoning with Dynamic Action Spaces

arXiv 2025

2025

PromptCoT: Synthesizing Olympiad-level Problems for Mathematical Reasoning in Large Language Models

arXiv 2025

2025

Exploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACs

arXiv 2025

2025

Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation

arXiv 2025

2025

Human Decision-making is Susceptible to AI-driven Manipulation

arXiv 2025

2025

TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection

arXiv 2024

2024

From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis

arXiv 2024

2024

Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning

arXiv 2024

2024

DreamLIP: Language-Image Pre-training with Long Captions

arXiv 2024

2024

RoboSense: Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments

CVPR 2025 1

2024

AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback

arXiv 2024

2024

Augmenting Transformers with Recursively Composed Multi-grained Representations

arXiv 2023

2023

TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification

arXiv 2023

2023

T5-SR: A Unified Seq-to-Seq Decoding Strategy for Semantic Parsing

arXiv 2023

2023

Low-Complexity Acoustic Echo Cancellation with Neural Kalman Filtering

arXiv 2022

2022

LidarGait: Benchmarking 3D Gait Recognition with Point Clouds

CVPR 2023 1

2022

Semi-Supervised Knowledge-Grounded Pre-training for Task-Oriented Dialog Systems

arXiv 2022

2022

ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

ACL 2021 5

2021

Pay More Attention to History: A Context Modelling Strategy for Conversational Text-to-SQL

arXiv 2021

2021

Affiliations

No known affiliations.

Frequent co-authors

10

from 32 papers