Xiyang Dai

Cited by

	All	Since 2019
Citations	9001	8945
h-index	28	28
i10-index	42	42

3500

1750

875

2625

201820192020202120222023202447 115 150 548 1895 3481 2744

Public access

View all

8 articles

0 articles

available

not available

Based on funding mandates

Co-authors

Larry DavisProfessor of Computer Science, University of MarylandVerified email at cs.umd.edu
Da ZhangPhD candidate, University of California, Santa BarbaraVerified email at cs.ucsb.edu
Bharat SinghCruiseVerified email at getcruise.com
Bogdan MateiTechnical Director, SRI InternationalVerified email at sri.com
Harpreet S SawhneyMicrosoftVerified email at microsoft.com
Joe Yue-Hei NgGoogle Research
Xinchao WangNational University of SingaporeVerified email at nus.edu.sg

Xiyang Dai

Microsoft

Verified email at microsoft.com - Homepage

Computer Vision Deep Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Cvt: Introducing convolutions to vision transformers H Wu, B Xiao, N Codella, M Liu, X Dai, L Yuan, L Zhang Proceedings of the IEEE/CVF international conference on computer vision, 22-31, 2021	1930	2021
Dynamic convolution: Attention over convolution kernels. Y Chen, X Dai, M Liu, D Chen, L Yuan, Z Liu CVF Conference on Computer Vision and Pattern Recognition, CVPR, 13-19, 2020	935	2020
Florence: A new foundation model for computer vision L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ... arXiv preprint arXiv:2111.11432, 2021	764	2021
Focal Self-attention for Local-Global Interactions in Vision Transformers J Yang, C Li, P Zhang, X Dai, B Xiao, L Yuan, J Gao Advances in Neural Information Processing Systems, 2021, 2021	545*	2021
Dynamic head: Unifying object detection heads with attentions X Dai, Y Chen, B Xiao, D Chen, M Liu, L Yuan, L Zhang Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021	537	2021
Mobile-former: Bridging mobilenet and transformer Y Chen, X Dai, D Chen, M Liu, X Dong, L Yuan, Z Liu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	468	2022
Regionclip: Region-based language-image pretraining Y Zhong, J Yang, P Zhang, C Li, N Codella, LH Li, L Zhou, X Dai, L Yuan, ... Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	416	2022
Multi-scale vision longformer: A new vision transformer for high-resolution image encoding P Zhang, X Dai, J Yang, B Xiao, L Yuan, L Zhang, J Gao Proceedings of the IEEE/CVF international conference on computer vision …, 2021	335	2021
Man: Moment alignment network for natural language moment retrieval via iterative graph adjustment D Zhang, X Dai, X Wang, YF Wang, LS Davis Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019	322	2019
Temporal context network for activity localization in videos X Dai, B Singh, G Zhang, LS Davis, Y Qiu Chen Proceedings of the IEEE International Conference on Computer Vision, 5793-5802, 2017	303	2017
Dynamic detr: End-to-end object detection with dynamic attention X Dai, Y Chen, J Yang, P Zhang, L Yuan, L Zhang Proceedings of the IEEE/CVF international conference on computer vision …, 2021	258	2021
Glipv2: Unifying localization and vision-language understanding H Zhang, P Zhang, X Hu, YC Chen, L Li, X Dai, L Wang, L Yuan, ... Advances in Neural Information Processing Systems 35, 36067-36080, 2022	224	2022
Bevt: Bert pretraining of video transformers R Wang, D Chen, Z Wu, Y Chen, X Dai, M Liu, YG Jiang, L Zhou, L Yuan Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	211	2022
Efficient self-supervised vision transformers for representation learning C Li, J Yang, P Zhang, M Gao, B Xiao, X Dai, L Yuan, J Gao arXiv preprint arXiv:2106.09785, 2021	211	2021
Dynamic ReLU Y Chen, X Dai, M Liu, D Chen, L Yuan, Z Liu European Conference on Computer Vision, 351-367, 2020	199	2020
Generalized decoding for pixel, image, and language X Zou, ZY Dou, J Yang, Z Gan, L Li, C Li, X Dai, H Behl, J Wang, L Yuan, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	170	2023
Focal modulation networks J Yang, C Li, X Dai, J Gao Advances in Neural Information Processing Systems 35, 4203-4217, 2022	161	2022
Phi-3 technical report: A highly capable language model locally on your phone M Abdin, SA Jacobs, AA Awan, J Aneja, A Awadallah, H Awadalla, ... arXiv preprint arXiv:2404.14219, 2024	121	2024
Fason: First and second order information fusion network for texture recognition X Dai, J Yue-Hei Ng, LS Davis Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2017	89	2017
Reduce information loss in transformers for pluralistic image inpainting Q Liu, Z Tan, D Chen, Q Chu, X Dai, Y Chen, M Liu, L Yuan, N Yu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	76	2022

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors