VTimeLLM: Empower LLM to grasp video moments B Huang, X Wang, H Chen, Z Song, W Zhu CVPR 2024, 2024 | 36 | 2024 |
Global-Local GraphFormer: Towards Better Understanding of User Intentions in Sequential Recommendation H Chen, B Huang, X Wang, Y Zhou, W Zhu Proceedings of the 5th ACM International Conference on Multimedia in Asia, 1-7, 2023 | 2 | 2023 |
Commonsense Learning: An Indispensable Path towards Human-centric Multimedia B Huang, S Tang, G Shen, G Li, X Wang, W Zhu Proceedings of the 1st International Workshop on Human-centric Multimedia …, 2020 | 2 | 2020 |
Multi-Modal Generative AI: Multi-modal LLM, Diffusion and Beyond H Chen, X Wang, Y Zhou, B Huang, Y Zhang, W Feng, H Chen, Z Zhang, ... arXiv preprint arXiv:2409.14993, 2024 | | 2024 |
Neighbor Does Matter: Curriculum Global Positive-Negative Sampling for Vision-Language Pre-training B Huang, F He, Q Wang, H Chen, G Li, Z Feng, X Wang, W Zhu ACM Multimedia 2024, 2024 | | 2024 |