Follow
Xuguang Duan
Xuguang Duan
Verified email at mails.tsinghua.edu.cn - Homepage
Title
Cited by
Cited by
Year
Weakly supervised dense event captioning in videos
X Duan, W Huang, C Gan, J Wang, W Zhu, J Huang
Advances in Neural Information Processing Systems 31, 2018
1532018
Avqa: A dataset for audio-visual question answering on videos
P Yang, X Wang, X Duan, H Chen, R Hou, C Jin, W Zhu
Proceedings of the 30th ACM international conference on multimedia, 3480-3491, 2022
302022
Memor: A dataset for multimodal emotion reasoning in videos
G Shen, X Wang, X Duan, H Li, W Zhu
Proceedings of the 28th ACM international conference on multimedia, 493-502, 2020
302020
Disenbooth: Disentangled parameter-efficient tuning for subject-driven text-to-image generation
H Chen, Y Zhang, X Wang, X Duan, Y Zhou, W Zhu
arXiv preprint arXiv:2305.03374, 2023
222023
Learning-to-ask: Knowledge acquisition via 20 questions
Y Chen, B Chen, X Duan, JG Lou, Y Wang, W Zhu, Y Cao
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge …, 2018
172018
STDMANet: Spatio-temporal differential multiscale attention network for small moving infrared target detection
P Yan, R Hou, X Duan, C Yue, X Wang, X Cao
IEEE transactions on geoscience and remote sensing 61, 1-16, 2023
162023
Disenbooth: Identity-preserving disentangled tuning for subject-driven text-to-image generation
H Chen, Y Zhang, S Wu, X Wang, X Duan, Y Zhou, W Zhu
The Twelfth International Conference on Learning Representations, 2023
112023
Curriculum-nas: Curriculum weight-sharing neural architecture search
Y Zhou, X Wang, H Chen, X Duan, C Guan, W Zhu
Proceedings of the 30th ACM International Conference on Multimedia, 6792-6801, 2022
82022
Dynamic spatio-temporal modular network for video question answering
Z Qian, X Wang, X Duan, H Chen, W Zhu
Proceedings of the 30th ACM International Conference on Multimedia, 4466-4477, 2022
82022
Multi-modal contextual graph neural network for text visual question answering
Y Liang, X Wang, X Duan, W Zhu
2020 25th International Conference on Pattern Recognition (ICPR), 3491-3498, 2021
82021
Deeplogic: joint learning of neural perception and logical reasoning
X Duan, X Wang, P Zhao, G Shen, W Zhu
IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (4), 4321-4334, 2022
72022
Watch, reason and code: Learning to represent videos using program
X Duan, Q Wu, C Gan, Y Zhang, W Huang, A Van Den Hengel, W Zhu
Proceedings of the 27th ACM International Conference on Multimedia, 1543-1551, 2019
62019
Decouple before interact: Multi-modal prompt learning for continual visual question answering
Z Qian, X Wang, X Duan, P Qin, Y Li, W Zhu
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
52023
Curriculum-listener: Consistency-and complementarity-aware audio-enhanced temporal sentence grounding
H Chen, X Wang, X Lan, H Chen, X Duan, J Jia, W Zhu
Proceedings of the 31st ACM International Conference on Multimedia, 3117-3128, 2023
22023
Multimedia Cognition and Evaluation in Open Environments
W Feng, H Li, X Wang, X Duan, Z Qian, W Liu, W Zhu
Proceedings of the 1st International Workshop on Multimedia Content …, 2023
12023
Parametric visual program induction with function modularization
X Duan, X Wang, Z Zhang, W Zhu
International Conference on Machine Learning, 5643-5658, 2022
12022
Unsupervised Image Sequence Registration and Enhancement for Infrared Small Target Detection
R Hou, P Yan, X Duan, X Wang
IEEE Transactions on Geoscience and Remote Sensing, 2024
2024
DisenDreamer: Subject-Driven Text-to-Image Generation with Sample-aware Disentangled Tuning
H Chen, Y Zhang, X Wang, X Duan, Y Zhou, W Zhu
IEEE Transactions on Circuits and Systems for Video Technology, 2024
2024
Modularized parametric visual program induction algorithm, device, medium and product
W Zhu, X Wang, D Xuguang
US Patent App. 18/197,746, 2024
2024
Intra-and Inter-Modal Curriculum for Multimodal Learning
Y Zhou, X Wang, H Chen, X Duan, W Zhu
Proceedings of the 31st ACM International Conference on Multimedia, 3724-3735, 2023
2023
The system can't perform the operation now. Try again later.
Articles 1–20