DQ-DETR: Dual query detection transformer for phrase extraction and grounding S Liu, S Huang, F Li, H Zhang, Y Liang, H Su, J Zhu, L Zhang Proceedings of the AAAI Conference on Artificial Intelligence 37 (2), 1728-1736, 2023 | 22 | 2023 |
Multi-modal contextual graph neural network for text visual question answering Y Liang, X Wang, X Duan, W Zhu 2020 25th International Conference on Pattern Recognition (ICPR), 3491-3498, 2021 | 10 | 2021 |
LUNA: Language as Continuing Anchors for Referring Expression Comprehension Y Liang, Z Yang, Y Tang, J Fan, Z Li, J Wang, PHS Torr, SL Huang Proceedings of the 31st ACM International Conference on Multimedia, 5174-5184, 2023 | 5 | 2023 |
Rca-noc: Relative contrastive alignment for novel object captioning J Fan, Y Liang, L Liu, S Huang, L Zhang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2023 | 4 | 2023 |
Exploring iterative refinement with diffusion models for video grounding X Liang, T Shi, Y Liang, T Tao, SL Huang 2024 IEEE International Conference on Multimedia and Expo (ICME), 1-6, 2024 | 2 | 2024 |
SSLCL: An Efficient Model-Agnostic Supervised Contrastive Learning Framework for Emotion Recognition in Conversations T Shi, X Liang, Y Liang, X Tong, SL Huang arXiv preprint arXiv:2310.16676, 2023 | 2 | 2023 |
CoSTA: End-to-End Comprehensive Space-Time Entanglement for Spatio-Temporal Video Grounding Y Liang, X Liang, Y Tang, Z Yang, Z Li, J Wang, W Ding, SL Huang Proceedings of the AAAI Conference on Artificial Intelligence 38 (4), 3324-3332, 2024 | 1 | 2024 |
A Theoretical Framework for Data Efficient Multi-Source Transfer Learning Based on Cram\'er-Rao Bound Q Zhang, H Fu, G Huang, Y Liang, C Chu, T Peng, Y Wu, Q Li, Y Li, ... arXiv preprint arXiv:2502.04242, 2025 | | 2025 |
A Non-asymptotic Framework for Characterizing Dependency Structures in Multimodal Learning W Wang, T Shi, Y Liang, X Tong, SL Huang 2024 IEEE Information Theory Workshop (ITW), 543-548, 2024 | | 2024 |
Unleashing Region Understanding in Intermediate Layers for MLLM-based Referring Expression Generation Y Liang, Z Cai, J Xu, G Huang, Y Wang, X Liang, J Liu, Z Li, J Wang, ... The Thirty-eighth Annual Conference on Neural Information Processing Systems, 0 | | |
A Mathematical Framework for Characterizing Dependency Structures of Multimodal Learning W Wang, T Shi, Y Liang, X Xu, F Ma, SL Huang, L Zheng | | |