Follow
Shaoxiang Chen
Shaoxiang Chen
Meituan
Verified email at fudan.edu.cn
Title
Cited by
Cited by
Year
Semantic proposal for activity localization in videos via sentence query
S Chen, YG Jiang
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 8199-8206, 2019
2042019
Black-box adversarial attacks on video recognition models
L Jiang, X Ma, S Chen, J Bailey, YG Jiang
Proceedings of the 27th ACM International Conference on Multimedia, 864-872, 2019
1552019
Motion guided spatial attention for video captioning
S Chen, YG Jiang
Proceedings of the AAAI Conference on Artificial Intelligence 33 (01), 8191-8198, 2019
1512019
Learning modality interaction for temporal sentence localization and event captioning in videos
S Chen, W Jiang, W Liu, YG Jiang
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
1092020
MSMDFusion: Fusing LiDAR and Camera at Multiple Scales with Multi-Depth Seeds for 3D Object Detection
Y Jiao, Z Jie, S Chen, J Chen, X Wei, L Ma, YG Jiang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023
892023
Towards Bridging Event Captioner and Sentence Localizer for Weakly Supervised Dense Event Captioning
S Chen, YG Jiang
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2021
742021
Motion Guided Region Message Passing for Video Captioning
S Chen, YG Jiang
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
682021
Hierarchical Visual-Textual Graph for Temporal Activity Localization via Language
S Chen, YG Jiang
Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23 …, 2020
592020
Deep Learning for Video Captioning: A Review
S Chen, T Yao, YG Jiang
Proceedings of the 28th International Joint Conference on Artificial …, 2019
592019
Non-local netvlad encoding for video classification
Y Tang, X Zhang, L Ma, J Wang, S Chen, YG Jiang
The 2nd Workshop on YouTube-8M Large-Scale Video Understanding (ECCV'18), 2018
452018
More: Multi-order relation mining for dense captioning in 3d scenes
Y Jiao, S Chen, Z Jie, J Chen, L Ma, YG Jiang
Computer Vision–ECCV 2022: 17th European Conference, Tel Aviv, Israel …, 2022
402022
Scene graph refinement network for visual question answering
T Qian, J Chen, S Chen, B Wu, YG Jiang
IEEE Transactions on Multimedia 25, 3950-3961, 2022
372022
Llava-mole: Sparse mixture of lora experts for mitigating data conflicts in instruction finetuning mllms
S Chen, Z Jie, L Ma
arXiv preprint arXiv:2401.16160, 2024
352024
Aggregating frame-level features for large-scale video classification
S Chen, X Wang, Y Tang, X Chen, Z Wu, YG Jiang
CVPR'17 Workshop on YouTube-8M Large-Scale Video Understanding, 2017
292017
FT-TDR: Frequency-guided Transformer and Top-Down Refinement Network for Blind Face Inpainting
J Wang, S Chen, Z Wu, YG Jiang
IEEE Transactions on Multimedia, 2022
282022
Self-supervised learning for semi-supervised temporal language grounding
F Luo, S Chen, J Chen, Z Wu, YG Jiang
IEEE Transactions on Multimedia, 2022
152022
Towards Bridging Video and Language by Caption Generation and Sentence Localization
S Chen
Proceedings of the 29th ACM International Conference on Multimedia, 2964-2968, 2021
72021
Lumen: Unleashing Versatile Vision-Centric Capabilities of Large Multimodal Models
Y Jiao, S Chen, Z Jie, J Chen, L Ma, YG Jiang
Advances in Neural Information Processing Systems 37 (NeurIPS 2024), 2024
62024
Instance-aware Multi-Camera 3D Object Detection with Structural Priors Mining and Self-Boosting Learning
Y Jiao, Z Jie, S Chen, L Cheng, J Chen, L Ma, YG Jiang
AAAI 2024, 2023
62023
System and method for video captioning
Y Jiang, S Chen
US Patent 10,699,129, 2020
62020
The system can't perform the operation now. Try again later.
Articles 1–20