Xuguang Duan

Cited by

	All	Since 2019
Citations	325	325
h-index	8	8
i10-index	7	7

140

105

20192020202120222023202417 27 41 55 126 59

Public access

View all

10 articles

2 articles

available

not available

Based on funding mandates

Co-authors

Wenwu ZhuProfessor, Computer Science, Tsinghua UniverisityVerified email at tsinghua.edu.cn
Xin WangDepartment of Computer Science and Technology, Tsinghua UniversityVerified email at tsinghua.edu.cn
Chuang GanUMass Amherst | MIT-IBM Watson AI LabVerified email at csail.mit.edu
Wenbing HuangAssociate Professor, Renmin University of ChinaVerified email at ruc.edu.cn
Guangyao ShenTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Yihong ChenUCL NLPVerified email at ucl.ac.uk
Bei ChenMicrosoft Research AsiaVerified email at microsoft.com
Yong CAOAlibaba Inc.Verified email at alibaba-inc.com
Jian-Guang LOUMicrosoft Research AsiaVerified email at microsoft.com
Yaoyuan LiangTsinghua UniversityVerified email at mails.tsinghua.edu.cn
Peilin Zhao (赵沛霖)TencentVerified email at tencent.com
Qi WuAssociate Professor, University of Adelaide, Adelaide, AustraliaVerified email at adelaide.edu.au
Yiwei ZhangUniversity of Wisconsin–MadisonVerified email at wisc.edu
Ziwei ZhangAssociate Professor, School of Computer Science and Engineering, Beihang University, ChinaVerified email at buaa.edu.cn

Xuguang Duan

Tsinghua

Verified email at mails.tsinghua.edu.cn - Homepage

Vision and Language Neural-Symbolic Learning


Title Sort by citations Sort by year Sort by title	Cited by Cited by	Year
Weakly supervised dense event captioning in videos X Duan, W Huang, C Gan, J Wang, W Zhu, J Huang Advances in Neural Information Processing Systems 31, 2018	153	2018
Avqa: A dataset for audio-visual question answering on videos P Yang, X Wang, X Duan, H Chen, R Hou, C Jin, W Zhu Proceedings of the 30th ACM international conference on multimedia, 3480-3491, 2022	30	2022
Memor: A dataset for multimodal emotion reasoning in videos G Shen, X Wang, X Duan, H Li, W Zhu Proceedings of the 28th ACM international conference on multimedia, 493-502, 2020	30	2020
Disenbooth: Disentangled parameter-efficient tuning for subject-driven text-to-image generation H Chen, Y Zhang, X Wang, X Duan, Y Zhou, W Zhu arXiv preprint arXiv:2305.03374, 2023	22	2023
Learning-to-ask: Knowledge acquisition via 20 questions Y Chen, B Chen, X Duan, JG Lou, Y Wang, W Zhu, Y Cao Proceedings of the 24th ACM SIGKDD International Conference on Knowledge …, 2018	17	2018
STDMANet: Spatio-temporal differential multiscale attention network for small moving infrared target detection P Yan, R Hou, X Duan, C Yue, X Wang, X Cao IEEE transactions on geoscience and remote sensing 61, 1-16, 2023	16	2023
Disenbooth: Identity-preserving disentangled tuning for subject-driven text-to-image generation H Chen, Y Zhang, S Wu, X Wang, X Duan, Y Zhou, W Zhu The Twelfth International Conference on Learning Representations, 2023	11	2023
Curriculum-nas: Curriculum weight-sharing neural architecture search Y Zhou, X Wang, H Chen, X Duan, C Guan, W Zhu Proceedings of the 30th ACM International Conference on Multimedia, 6792-6801, 2022	8	2022
Dynamic spatio-temporal modular network for video question answering Z Qian, X Wang, X Duan, H Chen, W Zhu Proceedings of the 30th ACM International Conference on Multimedia, 4466-4477, 2022	8	2022
Multi-modal contextual graph neural network for text visual question answering Y Liang, X Wang, X Duan, W Zhu 2020 25th International Conference on Pattern Recognition (ICPR), 3491-3498, 2021	8	2021
Deeplogic: joint learning of neural perception and logical reasoning X Duan, X Wang, P Zhao, G Shen, W Zhu IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (4), 4321-4334, 2022	7	2022
Watch, reason and code: Learning to represent videos using program X Duan, Q Wu, C Gan, Y Zhang, W Huang, A Van Den Hengel, W Zhu Proceedings of the 27th ACM International Conference on Multimedia, 1543-1551, 2019	6	2019
Decouple before interact: Multi-modal prompt learning for continual visual question answering Z Qian, X Wang, X Duan, P Qin, Y Li, W Zhu Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023	5	2023
Curriculum-listener: Consistency-and complementarity-aware audio-enhanced temporal sentence grounding H Chen, X Wang, X Lan, H Chen, X Duan, J Jia, W Zhu Proceedings of the 31st ACM International Conference on Multimedia, 3117-3128, 2023	2	2023
Multimedia Cognition and Evaluation in Open Environments W Feng, H Li, X Wang, X Duan, Z Qian, W Liu, W Zhu Proceedings of the 1st International Workshop on Multimedia Content …, 2023	1	2023
Parametric visual program induction with function modularization X Duan, X Wang, Z Zhang, W Zhu International Conference on Machine Learning, 5643-5658, 2022	1	2022
Unsupervised Image Sequence Registration and Enhancement for Infrared Small Target Detection R Hou, P Yan, X Duan, X Wang IEEE Transactions on Geoscience and Remote Sensing, 2024		2024
DisenDreamer: Subject-Driven Text-to-Image Generation with Sample-aware Disentangled Tuning H Chen, Y Zhang, X Wang, X Duan, Y Zhou, W Zhu IEEE Transactions on Circuits and Systems for Video Technology, 2024		2024
Modularized parametric visual program induction algorithm, device, medium and product W Zhu, X Wang, D Xuguang US Patent App. 18/197,746, 2024		2024
Intra-and Inter-Modal Curriculum for Multimodal Learning Y Zhou, X Wang, H Chen, X Duan, W Zhu Proceedings of the 31st ACM International Conference on Multimedia, 3724-3735, 2023		2023

The system can't perform the operation now. Try again later.

Articles 1–20

Citations per year

Duplicate citations

Merged citations

Add co-authorsCo-authors

Follow

Cited by

Co-authors