Huiqiang Jiang
- Papers
- 11
Cite
Notes
Only stored in your browser.
Authored papers
11MMInference: Accelerating Pre-filling for Long-Context VLMs via Modality-Aware Permutation Sparse Attention
arXiv 2025
RetroInfer: A Vector-Storage Approach for Scalable Long-Context LLM Inference
arXiv 2025
GUI-Actor: Coordinate-Free Visual Grounding for GUI Agents
arXiv 2025
Chain-of-Model Learning for Language Model
arXiv 2025
LLMLingua-2: Data Distillation for Efficient and Faithful Task-Agnostic Prompt Compression
arXiv 2024
MInference 1.0: Accelerating Pre-filling for Long-Context LLMs via Dynamic Sparse Attention
arXiv 2024
RetrievalAttention: Accelerating Long-Context LLM Inference via Vector Retrieval
arXiv 2024
Mitigate Position Bias in Large Language Models via Scaling a Single Dimension
arXiv 2024
CoLaDa: A Collaborative Label Denoising Framework for Cross-lingual Named Entity Recognition
arXiv 2023
Multi-Level Knowledge Distillation for Out-of-Distribution Detection in Text
arXiv 2022
Attentive Mask CLIP
ICCV 2023 1
Affiliations
Frequent co-authors
10from 11 papers