Zhiqiang Shen
- Papers
- 31
Cite
Notes
Only stored in your browser.
Authored papers
31Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems
arXiv 2026
Sink-Aware Pruning for Diffusion Language Models
arXiv 2026
From Masks to Pixels and Meaning: A New Taxonomy, Benchmark, and Metrics for VLM Image Tampering
arXiv 2026
Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents
arXiv 2025
OD3: Optimization-free Dataset Distillation for Object Detection
arXiv 2025
KITAB-Bench: A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding
arXiv 2025
Learning Efficient and Generalizable Graph Retriever for Knowledge-Graph Question Answering
arXiv 2025
A Frustratingly Simple Yet Highly Effective Attack Baseline: Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1
arXiv 2025
DarwinLM: Evolutionary Structured Pruning of Large Language Models
arXiv 2025
A Survey on Diffusion Language Models
arXiv 2025
Time Blindness: Why Video-Language Models Can't See What Humans Can?
arXiv 2025
Dataset Distillation via Committee Voting
arXiv 2025
Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark
arXiv 2025
VideoMolmo: Spatio-Temporal Grounding Meets Pointing
arXiv 2025
Crystal: Illuminating LLM Abilities on Language and Code
arXiv 2024
FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation
arXiv 2024
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs
arXiv 2024
Initializing Models with Larger Ones
arXiv 2023
LLM360: Towards Fully Transparent Open-Source LLMs
arXiv 2023
ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy
arXiv 2023
Beyond Size: How Gradients Shape Pruning Decisions in Large Language Models
arXiv 2023
Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching
CVPR 2024 1
Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos
ICCV 2023 1
Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment
arXiv 2023
Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4
arXiv 2023
Dropout Reduces Underfitting
arXiv 2023
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning
arXiv 2023
FerKD: Surgical Label Adaptation for Efficient Distillation
ferkd-surgical-label-adaptation-for-efficient
Dataset Distillation via Curriculum Data Synthesis in Large Data Era
arXiv 2023
Sliced Recursive Transformer
sliced-recursive-transformer
MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks
arXiv 2020
Affiliations
Frequent co-authors
10from 31 papers