Follow
Xiyang Dai
Xiyang Dai
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
Cvt: Introducing convolutions to vision transformers
H Wu, B Xiao, N Codella, M Liu, X Dai, L Yuan, L Zhang
Proceedings of the IEEE/CVF international conference on computer vision, 22-31, 2021
22612021
Dynamic convolution: Attention over convolution kernels.
Y Chen, X Dai, M Liu, D Chen, L Yuan, Z Liu
CVF Conference on Computer Vision and Pattern Recognition, CVPR, 13-19, 2020
10972020
Florence: A new foundation model for computer vision
L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ...
arXiv preprint arXiv:2111.11432, 2021
8912021
Dynamic head: Unifying object detection heads with attentions
X Dai, Y Chen, B Xiao, D Chen, M Liu, L Yuan, L Zhang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021
6762021
Focal Self-attention for Local-Global Interactions in Vision Transformers
J Yang, C Li, P Zhang, X Dai, B Xiao, L Yuan, J Gao
Advances in Neural Information Processing Systems, 2021, 2021
630*2021
Mobile-former: Bridging mobilenet and transformer
Y Chen, X Dai, D Chen, M Liu, X Dong, L Yuan, Z Liu
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
5802022
Regionclip: Region-based language-image pretraining
Y Zhong, J Yang, P Zhang, C Li, N Codella, LH Li, L Zhou, X Dai, L Yuan, ...
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
5422022
Phi-3 technical report: A highly capable language model locally on your phone
M Abdin, SA Jacobs, AA Awan, J Aneja, A Awadallah, H Awadalla, ...
arXiv preprint arXiv:2404.14219, 2024
5392024
Multi-scale vision longformer: A new vision transformer for high-resolution image encoding
P Zhang, X Dai, J Yang, B Xiao, L Yuan, L Zhang, J Gao
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
3782021
Man: Moment alignment network for natural language moment retrieval via iterative graph adjustment
D Zhang, X Dai, X Wang, YF Wang, LS Davis
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2019
3462019
Dynamic detr: End-to-end object detection with dynamic attention
X Dai, Y Chen, J Yang, P Zhang, L Yuan, L Zhang
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
3252021
Temporal context network for activity localization in videos
X Dai, B Singh, G Zhang, LS Davis, Y Qiu Chen
Proceedings of the IEEE International Conference on Computer Vision, 5793-5802, 2017
3072017
Glipv2: Unifying localization and vision-language understanding
H Zhang, P Zhang, X Hu, YC Chen, L Li, X Dai, L Wang, L Yuan, ...
Advances in Neural Information Processing Systems 35, 36067-36080, 2022
2912022
Bevt: Bert pretraining of video transformers
R Wang, D Chen, Z Wu, Y Chen, X Dai, M Liu, YG Jiang, L Zhou, L Yuan
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
2462022
Focal modulation networks
J Yang, C Li, X Dai, J Gao
Advances in Neural Information Processing Systems 35, 4203-4217, 2022
2432022
Efficient self-supervised vision transformers for representation learning
C Li, J Yang, P Zhang, M Gao, B Xiao, X Dai, L Yuan, J Gao
arXiv preprint arXiv:2106.09785, 2021
2342021
Generalized decoding for pixel, image, and language
X Zou, ZY Dou, J Yang, Z Gan, L Li, C Li, X Dai, H Behl, J Wang, L Yuan, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
2302023
Dynamic ReLU
Y Chen, X Dai, M Liu, D Chen, L Yuan, Z Liu
European Conference on Computer Vision, 351-367, 2020
2282020
Reduce information loss in transformers for pluralistic image inpainting
Q Liu, Z Tan, D Chen, Q Chu, X Dai, Y Chen, M Liu, L Yuan, N Yu
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
942022
Masked video distillation: Rethinking masked feature modeling for self-supervised video representation learning
R Wang, D Chen, Z Wu, Y Chen, X Dai, M Liu, L Yuan, YG Jiang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
922023
The system can't perform the operation now. Try again later.
Articles 1–20