Sheng Shen
- Papers
- 16
Cite
Notes
Only stored in your browser.
Authored papers
16MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens
arXiv 2024
LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement
arXiv 2024
Enhancing Large Vision Language Models with Self-Training on Image Comprehension
arXiv 2024
AvalonBench: Evaluating LLMs Playing the Game of Avalon
arXiv 2023
AgentBench: Evaluating LLMs as Agents
arXiv 2023
SqueezeLLM: Dense-and-Sparse Quantization
arXiv 2023
Poisoning Language Models During Instruction Tuning
arXiv 2023
HallE-Control: Controlling Object Hallucination in Large Multimodal Models
arXiv 2023
What Language Model to Train if You Have One Million GPU Hours?
arXiv 2022
Crosslingual Generalization through Multitask Finetuning
arXiv 2022
Multitask Vision-Language Prompt Tuning
arXiv 2022
How Much Can CLIP Benefit Vision-and-Language Tasks?
arXiv 2021
Learned Token Pruning for Transformers
arXiv 2021
ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning
arXiv 2020
PowerNorm: Rethinking Batch Normalization in Transformers
ICML 2020 1
Noisy Self-Knowledge Distillation for Text Summarization
NAACL 2021 4
Affiliations
Frequent co-authors
10from 16 papers