Sheng Shen

Papers: 16

Cite

Notes

Only stored in your browser.

Attribution

Affiliations & profile: Semantic Scholar

Attribution policy →

16papers

Authored papers

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

arXiv 2024

2024

LLM2LLM: Boosting LLMs with Novel Iterative Data Enhancement

arXiv 2024

2024

Enhancing Large Vision Language Models with Self-Training on Image Comprehension

arXiv 2024

2024

AvalonBench: Evaluating LLMs Playing the Game of Avalon

arXiv 2023

2023

AgentBench: Evaluating LLMs as Agents

arXiv 2023

2023

SqueezeLLM: Dense-and-Sparse Quantization

arXiv 2023

2023

Poisoning Language Models During Instruction Tuning

arXiv 2023

2023

HallE-Control: Controlling Object Hallucination in Large Multimodal Models

arXiv 2023

2023

What Language Model to Train if You Have One Million GPU Hours?

arXiv 2022

2022

Crosslingual Generalization through Multitask Finetuning

arXiv 2022

2022

Multitask Vision-Language Prompt Tuning

arXiv 2022

2022

Learned Token Pruning for Transformers

arXiv 2021

2021

How Much Can CLIP Benefit Vision-and-Language Tasks?

arXiv 2021

2021

ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning

arXiv 2020

2020

PowerNorm: Rethinking Batch Normalization in Transformers

ICML 2020 1

2020

Noisy Self-Knowledge Distillation for Text Summarization

NAACL 2021 4

2020

Affiliations

No known affiliations.

Frequent co-authors

from 16 papers

Kurt Keutzer

Amir Gholami

Michael W. Mahoney

Sehoon Kim

Zhewei Yao

Bohan Zhai

Colin Raffel

Kai-Wei Chang

Lintang Sutawika

M Saiful Bari