Follow
Xuguang Duan
Xuguang Duan
Verified email at mails.tsinghua.edu.cn - Homepage
Title
Cited by
Cited by
Year
Weakly supervised dense event captioning in videos
X Duan, W Huang, C Gan, J Wang, W Zhu, J Huang
Advances in Neural Information Processing Systems 31, 2018
1672018
Avqa: A dataset for audio-visual question answering on videos
P Yang, X Wang, X Duan, H Chen, R Hou, C Jin, W Zhu
Proceedings of the 30th ACM international conference on multimedia, 3480-3491, 2022
502022
Memor: A dataset for multimodal emotion reasoning in videos
G Shen, X Wang, X Duan, H Li, W Zhu
Proceedings of the 28th ACM international conference on multimedia, 493-502, 2020
392020
Disenbooth: Disentangled parameter-efficient tuning for subject-driven text-to-image generation
H Chen, Y Zhang, X Wang, X Duan, Y Zhou, W Zhu
arXiv preprint arXiv:2305.03374 3, 2023
362023
Disenbooth: Identity-preserving disentangled tuning for subject-driven text-to-image generation
H Chen, Y Zhang, S Wu, X Wang, X Duan, Y Zhou, W Zhu
arXiv preprint arXiv:2305.03374, 2023
332023
STDMANet: Spatio-temporal differential multiscale attention network for small moving infrared target detection
P Yan, R Hou, X Duan, C Yue, X Wang, X Cao
IEEE transactions on geoscience and remote sensing 61, 1-16, 2023
282023
Learning-to-ask: Knowledge acquisition via 20 questions
Y Chen, B Chen, X Duan, JG Lou, Y Wang, W Zhu, Y Cao
Proceedings of the 24th ACM SIGKDD International Conference on Knowledge …, 2018
192018
Curriculum-nas: Curriculum weight-sharing neural architecture search
Y Zhou, X Wang, H Chen, X Duan, C Guan, W Zhu
Proceedings of the 30th ACM International Conference on Multimedia, 6792-6801, 2022
142022
Decouple before interact: Multi-modal prompt learning for continual visual question answering
Z Qian, X Wang, X Duan, P Qin, Y Li, W Zhu
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023
122023
Dynamic spatio-temporal modular network for video question answering
Z Qian, X Wang, X Duan, H Chen, W Zhu
Proceedings of the 30th ACM International Conference on Multimedia, 4466-4477, 2022
112022
Deeplogic: Joint learning of neural perception and logical reasoning
X Duan, X Wang, P Zhao, G Shen, W Zhu
IEEE Transactions on Pattern Analysis and Machine Intelligence 45 (4), 4321-4334, 2022
112022
Intra-and Inter-Modal Curriculum for Multimodal Learning
Y Zhou, X Wang, H Chen, X Duan, W Zhu
Proceedings of the 31st ACM International Conference on Multimedia, 3724-3735, 2023
102023
Curriculum-listener: Consistency-and complementarity-aware audio-enhanced temporal sentence grounding
H Chen, X Wang, X Lan, H Chen, X Duan, J Jia, W Zhu
Proceedings of the 31st ACM International Conference on Multimedia, 3117-3128, 2023
92023
Multi-modal contextual graph neural network for text visual question answering
Y Liang, X Wang, X Duan, W Zhu
2020 25th International Conference on Pattern Recognition (ICPR), 3491-3498, 2021
82021
Watch, reason and code: Learning to represent videos using program
X Duan, Q Wu, C Gan, Y Zhang, W Huang, A Van Den Hengel, W Zhu
Proceedings of the 27th ACM International Conference on Multimedia, 1543-1551, 2019
82019
DisenDreamer: Subject-Driven Text-to-Image Generation with Sample-aware Disentangled Tuning
H Chen, Y Zhang, X Wang, X Duan, Y Zhou, W Zhu
IEEE Transactions on Circuits and Systems for Video Technology, 2024
72024
Unsupervised Image Sequence Registration and Enhancement for Infrared Small Target Detection
R Hou, P Yan, X Duan, X Wang
IEEE Transactions on Geoscience and Remote Sensing, 2024
22024
Parametric visual program induction with function modularization
X Duan, X Wang, Z Zhang, W Zhu
International Conference on Machine Learning, 5643-5658, 2022
22022
Multimedia Cognition and Evaluation in Open Environments
W Feng, H Li, X Wang, X Duan, Z Qian, W Liu, W Zhu
Proceedings of the 1st International Workshop on Multimedia Content …, 2023
12023
H2V4Sports: Real-Time Horizontal-to-Vertical Video Converter for Sports Lives via Fast Object Detection and Tracking
Y Han, K Li, Z Song, W Feng, X Cao, S Guo, X Wang, X Duan, W Zhu
Proceedings of the 31st ACM International Conference on Multimedia, 9376-9378, 2023
12023
The system can't perform the operation now. Try again later.
Articles 1–20