Semantic proposal for activity localization in videos via sentence query S Chen, YG Jiang Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 8199-8206, 2019 | 204 | 2019 |
Black-box adversarial attacks on video recognition models L Jiang, X Ma, S Chen, J Bailey, YG Jiang Proceedings of the 27th ACM International Conference on Multimedia, 864-872, 2019 | 155 | 2019 |
Motion guided spatial attention for video captioning S Chen, YG Jiang Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 8191-8198, 2019 | 151 | 2019 |
Learning modality interaction for temporal sentence localization and event captioning in videos S Chen, W Jiang, W Liu, YG Jiang Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 109 | 2020 |
MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection Y Jiao, Z Jie, S Chen, J Chen, X Wei, L Ma, YG Jiang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 89 | 2023 |
Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning S Chen, YG Jiang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021 | 74 | 2021 |
Motion Guided Region Message Passing for Video Captioning S Chen, YG Jiang Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021 | 68 | 2021 |
Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language S Chen, YG Jiang Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020 | 59 | 2020 |
Deep Learning for Video Captioning: A Review S Chen, T Yao, YG Jiang Proceedings of the 28th International Joint Conference on Artificial …, 2019 | 59 | 2019 |
Non-local netvlad encoding for video classification Y Tang, X Zhang, L Ma, J Wang, S Chen, YG Jiang The 2nd Workshop on YouTube-8M Large-Scale Video Understanding (ECCV'18), 2018 | 45 | 2018 |
More: Multi-order relation mining for dense captioning in 3d scenes Y Jiao, S Chen, Z Jie, J Chen, L Ma, YG Jiang Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022 | 40 | 2022 |
Scene graph refinement network for visual question answering T Qian, J Chen, S Chen, B Wu, YG Jiang IEEE Transactions on Multimedia 25, 3950-3961, 2022 | 37 | 2022 |
Llava-mole: Sparse mixture of lora experts for mitigating data conflicts in instruction finetuning mllms S Chen, Z Jie, L Ma arXiv preprint arXiv:2401.16160, 2024 | 35 | 2024 |
Aggregating frame-level features for large-scale video classification S Chen, X Wang, Y Tang, X Chen, Z Wu, YG Jiang CVPR'17 Workshop on YouTube-8M Large-Scale Video Understanding, 2017 | 29 | 2017 |
FT-TDR: Frequency-guided Transformer and Top-Down Refinement Network for Blind Face Inpainting J Wang, S Chen, Z Wu, YG Jiang IEEE Transactions on Multimedia, 2022 | 28 | 2022 |
Self-supervised learning for semi-supervised temporal language grounding F Luo, S Chen, J Chen, Z Wu, YG Jiang IEEE Transactions on Multimedia, 2022 | 15 | 2022 |
Towards Bridging Video and Language by Caption Generation and Sentence Localization S Chen Proceedings of the 29th ACM International Conference on Multimedia, 2964-2968, 2021 | 7 | 2021 |
Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models Y Jiao, S Chen, Z Jie, J Chen, L Ma, YG Jiang Advances in Neural Information Processing Systems 37 (NeurIPS 2024), 2024 | 6 | 2024 |
Instance-aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning Y Jiao, Z Jie, S Chen, L Cheng, J Chen, L Ma, YG Jiang AAAI 2024, 2023 | 6 | 2023 |
System and method for video captioning Y Jiang, S Chen US Patent 10,699,129, 2020 | 6 | 2020 |