Wei Wu
- Papers
- 32
Cite
Notes
Only stored in your browser.
Authored papers
32OScaR: The Occam's Razor for Extreme KV Cache Quantization in LLMs and Beyond
arXiv 2026
A Pragmatic VLA Foundation Model
arXiv 2026
MemForest: An Efficient Agent Memory System with Hierarchical Temporal Indexing
arXiv 2026
SEAL: Synergistic Co-Evolution of Agents and Learning Environments
arXiv 2026
DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards
arXiv 2026
Attention Sink in Transformers: A Survey on Utilization, Interpretation, and Mitigation
arXiv 2026
PresentAgent-2: Towards Generalist Multimodal Presentation Agents
arXiv 2026
Reinforcing Spatial Reasoning in Vision-Language Models with Interwoven Thinking and Visual Drawing
arXiv 2025
ReFusion: A Diffusion Large Language Model with Parallel Autoregressive Decoding
arXiv 2025
PromptCoT 2.0: Scaling Prompt Synthesis for Large Language Model Reasoning
arXiv 2025
RoboScape: Physics-informed Embodied World Model
arXiv 2025
GENERator: A Long-Context Generative Genomic Foundation Model
arXiv 2025
LEMMA: Learning from Errors for MatheMatical Advancement in LLMs
arXiv 2025
DynaAct: Large Language Model Reasoning with Dynamic Action Spaces
arXiv 2025
PromptCoT: Synthesizing Olympiad-level Problems for Mathematical Reasoning in Large Language Models
arXiv 2025
Exploring the Performance Improvement of Tensor Processing Engines through Transformation in the Bit-weight Dimension of MACs
arXiv 2025
Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation
arXiv 2025
Human Decision-making is Susceptible to AI-driven Manipulation
arXiv 2025
TokenSelect: Efficient Long-Context Inference and Length Extrapolation for LLMs via Dynamic Token-Level KV Cache Selection
arXiv 2024
From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis
arXiv 2024
Unlocking Reasoning Potential in Large Langauge Models by Scaling Code-form Planning
arXiv 2024
DreamLIP: Language-Image Pre-training with Long Captions
arXiv 2024
RoboSense: Large-scale Dataset and Benchmark for Egocentric Robot Perception and Navigation in Crowded and Unstructured Environments
CVPR 2025 1
AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback
arXiv 2024
Augmenting Transformers with Recursively Composed Multi-grained Representations
arXiv 2023
TagAlign: Improving Vision-Language Alignment with Multi-Tag Classification
arXiv 2023
T5-SR: A Unified Seq-to-Seq Decoding Strategy for Semantic Parsing
arXiv 2023
Low-Complexity Acoustic Echo Cancellation with Neural Kalman Filtering
arXiv 2022
LidarGait: Benchmarking 3D Gait Recognition with Point Clouds
CVPR 2023 1
Semi-Supervised Knowledge-Grounded Pre-training for Task-Oriented Dialog Systems
arXiv 2022
ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer
ACL 2021 5
Pay More Attention to History: A Context Modelling Strategy for Conversational Text-to-SQL
arXiv 2021
Affiliations
Frequent co-authors
10from 32 papers