Sheng Shen

Cited by

	All	Since 2019
Citations	6340	6318
h-index	28	28
i10-index	35	35

2700

1350

675

2025

201820192020202120222023202418 41 153 348 850 2617 2295

Public access

View all

7 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Kurt KeutzerProfessor of the Graduate School, EECS, University of California, BerkeleyVerified email at berkeley.edu
Zhewei YaoSnowflakeVerified email at snowflake.com
Michael MahoneyProfessor of Statistics, UC BerkeleyVerified email at stat.berkeley.edu
Amir GholamiResearch Scientist, University of California, BerkeleyVerified email at eecs.berkeley.edu
Trevor DarrellProfessor of Computer Science, U.C. BerkeleyVerified email at eecs.berkeley.edu
Chunyuan LiMicrosoft Research, RedmondVerified email at microsoft.com
Joseph E. GonzalezProfessor of Computer Science, UC BerkeleyVerified email at berkeley.edu
Xuanzhe LiuBoya Distinguished Professor of Computer Science, Peking University, ACM Distinguished ScientistVerified email at pku.edu.cn
Qiaozhu MeiProfessor, University of MichiganVerified email at umich.edu
Iz BeltagyAllen Institute for Artificial IntelligenceVerified email at beltagy.net
Le HouGoogleVerified email at google.com
Denny ZhouResearch Scientist, Google DeepMindVerified email at google.com
Douwe KielaContextual AI, Stanford UniversityVerified email at stanford.edu
Yaliang LiAlibaba GroupVerified email at alibaba-inc.com
Dan KleinUC Berkeley

Sheng Shen

UC Berkeley

Verified email at berkeley.edu - Homepage

Machine Learning Natural Language Processing


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Multitask prompted training enables zero-shot task generalization V Sanh, A Webson, C Raffel, SH Bach, L Sutawika, Z Alyafeai, A Chaffin, ... ICLR 2022, 2021	1403	2021
Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ...	1350	2023
Q-bert: Hessian based ultra low precision quantization of bert S Shen, Z Dong, J Ye, L Ma, Z Yao, A Gholami, MW Mahoney, K Keutzer AAAI 2020, 2019	533	2019
Crosslingual generalization through multitask finetuning N Muennighoff, T Wang, L Sutawika, A Roberts, S Biderman, TL Scao, ... ACL 2023, 2022	473	2022
How Much Can CLIP Benefit Vision-and-Language Tasks? S Shen, LH Li, H Tan, M Bansal, A Rohrbach, KW Chang, Z Yao, ... ICLR 2022, 2021	374	2021
Train Large, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers Z Li, E Wallace, S Shen, K Lin, K Keutzer, D Klein, JE Gonzalez ICML 2020, 2020	267	2020
ADAHESSIAN: An Adaptive Second Order Optimizer for Machine Learning Z Yao, A Gholami, S Shen, K Keutzer, MW Mahoney AAAI 2021, 2020	242	2020
Agentbench: Evaluating llms as agents X Liu, H Yu, H Zhang, Y Xu, X Lei, H Lai, Y Gu, H Ding, K Men, K Yang, ... arXiv preprint arXiv:2308.03688, 2023	199*	2023
Learned token pruning for transformers S Kim, S Shen, D Thorsley, A Gholami, W Kwon, J Hassoun, K Keutzer KDD 2022, 2021	101	2021
Aligning large multimodal models with factually augmented rlhf Z Sun, S Shen, S Cao*, H Liu, C Li, Y Shen, C Gan, LY Gui, YX Wang, ... arXiv preprint arXiv:2309.14525, 2023	98	2023
An annotated dataset of literary entities D Bamman, S Popat, S Shen NAACL 2019, 2019	97	2019
Poisoning Language Models During Instruction Tuning A Wan, E Wallace, S Shen, D Klein ICML 2023, 2023	91	2023
Llava-next: Improved reasoning, ocr, and world knowledge H Liu, C Li, Y Li, B Li, Y Zhang, S Shen, YJ Lee	90	2024
What Language Model to Train if You Have One Million GPU Hours? T Le Scao, T Wang, D Hesslow, L Saulnier, S Bekman, MS Bari, ... EMNLP 2022, 2022	89	2022
Powernorm: Rethinking batch normalization in transformers S Shen, Z Yao, A Gholami, M Mahoney, K Keutzer ICML 2020, 2020	85	2020
Ermes: Emoji-Powered Representation Learning for Cross-Lingual Sentiment Classification Z Chen, S Shen, Z Hu, X Lu, Q Mei, X Liu WWW 2019, 2018	85*	2018
SqueezeLLM: Dense-and-Sparse Quantization S Kim, C Hooper, A Gholami*, Z Dong, X Li, S Shen, MW Mahoney, ... arXiv preprint arXiv:2306.07629, 2023	84	2023
K-lite: Learning transferable visual models with external knowledge S Shen, C Li, X Hu, Y Xie, J Yang, P Zhang, A Rohrbach, Z Gan, L Wang, ... NeurIPS 2022, 2022	71	2022
Pragmatically Informative Text Generation S Shen, D Fried, J Andreas, D Klein NAACL 2019, 2019	70	2019
Through a gender lens: An empirical study of emoji usage over large-scale android users Z Chen, X Lu, S Shen, W Ai, X Liu, Q Mei arXiv preprint arXiv:1705.05546 10 (3178876.3186157), 2017	70	2017

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors