Chulhee Yun

Cited by

	All	Since 2019
Citations	1304	1261
h-index	14	14
i10-index	17	17

320

160

240

201720182019202020212022202320244 38 58 108 215 248 313 318

Public access

View all

8 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Suvrit SraTU Munich, MITVerified email at mit.edu
Ali JadbabaieJR East Professor of Engineering, MITVerified email at mit.edu
Srinadh BhojanapalliResearch Scientist, Google ResearchVerified email at google.com
Ankit Singh RawatResearch Scientist, GoogleVerified email at google.com
Sashank J. ReddiResearch Scientist, Google ResearchVerified email at cs.cmu.edu
Sanjiv KumarGoogle Fellow, VP, Google ResearchVerified email at google.com
Jaeho LeePOSTECHVerified email at postech.ac.kr
Jinwoo ShinKAIST Endowed Chair ProfessorVerified email at kaist.ac.kr
Sejun ParkAssistant Professor, Korea UniversityVerified email at korea.ac.kr
Kwangjun AhnSenior Researcher, Microsoft ResearchVerified email at microsoft.com
Hanseul ChoPhD Student @ KAIST AIVerified email at kaist.ac.kr
Yin-Wen ChangGoogle Inc.Verified email at google.com
Minhak SongKAISTVerified email at kaist.ac.kr
Hossein MobahiSenior Research Scientist @ GoogleVerified email at csail.mit.edu
Shankar KrishnanGoogle ResearchVerified email at google.com
Se Young YunKAISTVerified email at kaist.ac.kr
Shashank RajputResearch Scientist, MosaicML (Databricks)Verified email at databricks.com
Jaewook LeeMS Student, KAIST AIVerified email at kaist.ac.kr
Jaeyoung ChaPhD student, KAIST AIVerified email at kaist.ac.kr
Hojoon LeeKAISTVerified email at kaist.ac.kr

Chulhee Yun

Assistant Professor, KAIST Kim Jaechul Graduate School of AI

Verified email at kaist.ac.kr - Homepage

optimization deep learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Are Transformers universal approximators of sequence-to-sequence functions? C Yun, S Bhojanapalli, AS Rawat, SJ Reddi, S Kumar ICLR 2020 (arXiv:1912.10077), 2019	308	2019
Minimum width for universal approximation S Park, C Yun, J Lee, J Shin ICLR 2021 (arXiv:2006.08859), 2020	133	2020
Small nonlinearities in activation functions create bad local minima in neural networks C Yun, S Sra, A Jadbabaie ICLR 2019 (arXiv:1802.03487), 2018	129*	2018
Small ReLU networks are powerful memorizers: a tight analysis of memorization capacity C Yun, S Sra, A Jadbabaie NeurIPS 2019 (arXiv:1810.07770), 2019	122	2019
Global optimality conditions for deep neural networks C Yun, S Sra, A Jadbabaie ICLR 2018 (arXiv:1707.02444), 2017	116	2017
A Unifying View on Implicit Bias in Training Linear Neural Networks C Yun, S Krishnan, H Mobahi ICLR 2021 (arXiv:2010.02501), 2020	78	2020
Low-Rank Bottleneck in Multi-head Attention Models S Bhojanapalli, C Yun, AS Rawat, SJ Reddi, S Kumar ICML 2020 (arXiv:2002.07028), 2020	76	2020
Connections are Expressive Enough: Universal Approximability of Sparse Transformers C Yun, YW Chang, S Bhojanapalli, AS Rawat, SJ Reddi, S Kumar NeurIPS 2020 (arXiv:2006.04862), 2020	66	2020
SGD with shuffling: optimal rates without component convexity and large epoch requirements K Ahn, C Yun, S Sra NeurIPS 2020 (arXiv:2006.06946), 2020	65	2020
Minibatch vs local SGD with shuffling: Tight convergence bounds and beyond C Yun, S Rajput, S Sra ICLR 2022 (arXiv:2110.10342), 2021	36	2021
Provable memorization via deep neural networks using sub-linear parameters S Park, J Lee, C Yun, J Shin COLT 2021 (arXiv:2010.13363), 2021	30	2021
Minimax bounds on stochastic batched convex optimization J Duchi, F Ruan, C Yun Conference On Learning Theory, 3065-3162, 2018	29	2018
Linear attention is (maybe) all you need (to understand transformer optimization) K Ahn, X Cheng, M Song, C Yun, A Jadbabaie, S Sra ICLR 2024 (arXiv:2310.01082), 2023	20	2023
Open Problem: Can Single-Shuffle SGD be Better than Reshuffling SGD and GD? C Yun, S Sra, A Jadbabaie COLT 2021 (arXiv:2103.07079), 2021	17*	2021
Tighter Lower Bounds for Shuffling SGD: Random Permutations and Beyond J Cha, J Lee, C Yun ICML 2023 (arXiv:2303.07160), 2023	14	2023
Are deep ResNets provably better than linear predictors? C Yun, S Sra, A Jadbabaie NeurIPS 2019 (arXiv:1907.03922), 2019	14	2019
PLASTIC: Improving Input and Label Plasticity for Sample Efficient Reinforcement Learning H Lee, H Cho, H Kim, D Gwak, J Kim, J Choo, SY Yun, C Yun NeurIPS 2023 (arXiv:2306.10711), 2023	13*	2023
Efficiently testing local optimality and escaping saddles for ReLU networks C Yun, S Sra, A Jadbabaie ICLR 2019 (arXiv:1809.10858), 2018	9	2018
Practical Sharpness-Aware Minimization Cannot Converge All the Way to Optima D Si, C Yun NeurIPS 2023 (arXiv:2306.09850), 2023	7	2023
Provable Benefit of Mixup for Finding Optimal Decision Boundaries J Oh, C Yun ICML 2023 (arXiv:2306.00267), 2023	6	2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors