Xiaoyi Dong

Cited by

	All	Since 2019
Citations	3291	3287
h-index	26	26
i10-index	31	31

1600

800

400

1200

2020202120222023202421 82 464 1150 1566

Public access

View all

17 articles

1 article

available

not available

Based on funding mandates

Co-authors

Nenghai YuUniversity of Science and Technology of ChinaVerified email at ustc.edu.cn
weiming zhangUniversity of Science and Technology of ChinaVerified email at ustc.edu.cn
Dongdong ChenPrincipal Research Manager, GenAI, MicrosoftVerified email at mail.ustc.edu.cn
Lu YuanPrincipal Research Manager, Cognition, Cloud & AI, MicrosoftVerified email at microsoft.com
Jiaqi WangShanghai AI LaboratoryVerified email at pjlab.org.cn
Pan ZhangShanghai AI LaboratoryVerified email at mail.ustc.edu.cn
Jianmin BaoMicrosoft ResearchVerified email at microsoft.com
Dong ChenPrincipal Research Manager, Microsoft Research AsiaVerified email at microsoft.com
Yuhang ZangShanghai AI LaboratoryVerified email at pjlab.org.cn
Baining GuoDistinguished Scientist, Microsoft ResearchVerified email at microsoft.com
Qidong HuangUniversity of Science and Technology of ChinaVerified email at mail.ustc.edu.cn

Xiaoyi Dong

Shanghai AI Laboratory

Verified email at mail.ustc.edu.cn - Homepage

Computer Vision


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
CSWin transformer: A general vision transformer backbone with cross-shaped windows X Dong, J Bao, D Chen, W Zhang, N Yu, L Yuan, D Chen, B Guo IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022), 2021	912	2021
Mobile-former: Bridging mobilenet and transformer Y Chen, X Dai, D Chen, M Liu, X Dong, L Yuan, Z Liu IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022), 2021	468	2021
Peco: Perceptual codebook for bert pre-training of vision transformers X Dong, J Bao, T Zhang, D Chen, W Zhang, L Yuan, D Chen, F Wen, N Yu Thirty-Seventh AAAI Conference on Artificial Intelligence (AAAI), 2021	215	2021
Sharegpt4v: Improving large multi-modal models with better captions L Chen, J Li, X Dong, P Zhang, C He, J Wang, F Zhao, D Lin arXiv preprint arXiv:2311.12793, 2023	167	2023
Internlm: A multilingual language model with progressively enhanced capabilities ILM Team 2023-01-06)[2023-09-27]. https://github. com/InternLM/InternLM, 2023	146	2023
Protecting Celebrities from DeepFake with Identity Consistency Transformer X Dong, J Bao, D Chen, T Zhang, W Zhang, N Yu, D Chen, F Wen, B Guo IEEE Conference on Computer Vision and Pattern Recognition (CVPR 2022), 2022	114	2022
Lg-gan: Label guided adversarial network for flexible targeted attack of point cloud based deep networks H Zhou, D Chen, J Liao, K Chen, X Dong, K Liu, W Zhang, G Hua, N Yu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020	102	2020
InternLM-XComposer: A Vision-Language Large Model for Advanced Text-image Comprehension and Composition P Zhang, X Dong, B Wang, Y Cao, C Xu, L Ouyang, Z Zhao, S Ding, ... arXiv preprint arXiv:2309.15112, 2023	98	2023
Maskclip: Masked self-distillation advances contrastive language-image pretraining X Dong, J Bao, Y Zheng, T Zhang, D Chen, H Yang, M Zeng, W Zhang, ... Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	92	2023
Internlm-xcomposer2: Mastering free-form text-image composition and comprehension in vision-language large model X Dong, P Zhang, Y Zang, Y Cao, B Wang, L Ouyang, X Wei, S Zhang, ... arXiv preprint arXiv:2401.16420, 2024	79	2024
Robust superpixel-guided attentional adversarial attack X Dong, J Han, D Chen, J Liu, H Bian, Z Ma, H Li, X Wang, W Zhang, N Yu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020	67	2020
GreedyFool: Distortion-Aware Sparse Adversarial Attack X Dong, D Chen, J Bao, C Qin, L Yuan, W Zhang, N Yu, D Chen Advances in Neural Information Processing Systems 33 (NeurIPS 2020), 2020	63	2020
Bootstrapped Masked Autoencoders for Vision BERT Pretraining X Dong, J Bao, T Zhang, D Chen, W Zhang, L Yuan, D Chen, F Wen, N Yu ECCV 2022, 2022	62	2022
Internlm2 technical report Z Cai, M Cao, H Chen, K Chen, K Chen, X Chen, X Chen, Z Chen, Z Chen, ... arXiv preprint arXiv:2403.17297, 2024	58	2024
Shape-invariant 3D adversarial point clouds Q Huang, X Dong, D Chen, H Zhou, W Zhang, N Yu Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022	57	2022
Self-robust 3d point recognition via gather-vector guidance X Dong, D Chen, H Zhou, G Hua, W Zhang, N Yu 2020 IEEE/CVF conference on computer vision and pattern recognition (cvpr …, 2020	53	2020
How far are we to gpt-4v? closing the gap to commercial multimodal models with open-source suites Z Chen, W Wang, H Tian, S Ye, Z Gao, E Cui, W Tong, K Hu, J Luo, Z Ma, ... arXiv preprint arXiv:2404.16821, 2024	51	2024
Local geometric distortions resilient watermarking scheme based on symmetry Z Ma, W Zhang, H Fang, X Dong, L Geng, N Yu IEEE Transactions on Circuits and Systems for Video Technology 31 (12), 4826 …, 2021	45	2021
Diversity-aware meta visual prompting Q Huang, X Dong, D Chen, W Zhang, F Wang, G Hua, N Yu Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023	41	2023
Once a man: Towards multi-target attack via learning multi-target adversarial network once J Han, X Dong, R Zhang, D Chen, W Zhang, N Yu, P Luo, X Wang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2019	39	2019

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors