0

Zhiqiang Shen

Papers
31

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile
Semantic Scholar
Attribution policy →
31papers

Authored papers

31

Dive into Claude Code: The Design Space of Today's and Future AI Agent Systems

arXiv 2026

2026

Sink-Aware Pruning for Diffusion Language Models

arXiv 2026

2026

From Masks to Pixels and Meaning: A New Taxonomy, Benchmark, and Metrics for VLM Image Tampering

arXiv 2026

2026

Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents

arXiv 2025

2025

OD3: Optimization-free Dataset Distillation for Object Detection

arXiv 2025

2025

KITAB-Bench: A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding

arXiv 2025

2025

Learning Efficient and Generalizable Graph Retriever for Knowledge-Graph Question Answering

arXiv 2025

2025

A Frustratingly Simple Yet Highly Effective Attack Baseline: Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1

arXiv 2025

2025

DarwinLM: Evolutionary Structured Pruning of Large Language Models

arXiv 2025

2025

A Survey on Diffusion Language Models

arXiv 2025

2025

Time Blindness: Why Video-Language Models Can't See What Humans Can?

arXiv 2025

2025

Dataset Distillation via Committee Voting

arXiv 2025

2025

Mobile-MMLU: A Mobile Intelligence Language Understanding Benchmark

arXiv 2025

2025

VideoMolmo: Spatio-Temporal Grounding Meets Pointing

arXiv 2025

2025

Crystal: Illuminating LLM Abilities on Language and Code

arXiv 2024

2024

FBI-LLM: Scaling Up Fully Binarized LLMs from Scratch via Autoregressive Distillation

arXiv 2024

2024

Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs

arXiv 2024

2024

Initializing Models with Larger Ones

arXiv 2023

2023

LLM360: Towards Fully Transparent Open-Source LLMs

arXiv 2023

2023

ConvNet vs Transformer, Supervised vs CLIP: Beyond ImageNet Accuracy

arXiv 2023

2023

Beyond Size: How Gradients Shape Pruning Decisions in Large Language Models

arXiv 2023

2023

Generalized Large-Scale Data Condensation via Various Backbone and Statistical Matching

CVPR 2024 1

2023

Masked Spatio-Temporal Structure Prediction for Self-supervised Learning on Point Cloud Videos

ICCV 2023 1

2023

Towards Realistic Zero-Shot Classification via Self Structural Semantic Alignment

arXiv 2023

2023

Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4

arXiv 2023

2023

Dropout Reduces Underfitting

arXiv 2023

2023

One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning

arXiv 2023

2023

FerKD: Surgical Label Adaptation for Efficient Distillation

ferkd-surgical-label-adaptation-for-efficient

2023

Dataset Distillation via Curriculum Data Synthesis in Large Data Era

arXiv 2023

2023

Sliced Recursive Transformer

sliced-recursive-transformer

2021

MEAL V2: Boosting Vanilla ResNet-50 to 80%+ Top-1 Accuracy on ImageNet without Tricks

arXiv 2020

2020

Affiliations

No known affiliations.

Frequent co-authors

10

from 31 papers