On classification of distorted images with deep convolutional neural networks Y Zhou, S Song, NM Cheung Acoustics, Speech and Signal Processing (ICASSP), 2017 IEEE International …, 2017 | 158 | 2017 |
Multimodal multi-stream deep learning for egocentric activity recognition S Song, V Chandrasekhar, B Mandal, L Li, JH Lim, G Sateesh Babu, ... Proceedings of the IEEE conference on computer vision and pattern …, 2016 | 97 | 2016 |
Egocentric activity recognition with multimodal fisher vector S Song, NM Cheung, V Chandrasekhar, B Mandal, J Liri 2016 IEEE International conference on acoustics, speech and signal …, 2016 | 47 | 2016 |
Activity recognition in egocentric life-logging videos S Song, V Chandrasekhar, NM Cheung, S Narayan, L Li, JH Lim Computer Vision-ACCV 2014 Workshops: Singapore, Singapore, November 1-2 …, 2015 | 45 | 2015 |
Defense against adversarial attacks with saak transform S Song, Y Chen, NM Cheung, CCJ Kuo arXiv preprint arXiv:1808.01785, 2018 | 30 | 2018 |
Vision-language pre-training for boosting scene text detectors S Song, J Wan, Z Yang, J Tang, W Cheng, X Bai, C Yao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022 | 27 | 2022 |
Truly multi-modal youtube-8m video classification with video, audio, and text Z Wang, K Kuan, M Ravaut, G Manek, S Song, Y Fang, S Kim, N Chen, ... arXiv preprint arXiv:1706.05461, 2017 | 26 | 2017 |
Saak transform-based machine learning for light-sheet imaging of cardiac trabeculation Y Ding, V Gudapati, R Lin, Y Fei, RRS Packard, S Song, CC Chang, ... IEEE Transactions on Biomedical Engineering 68 (1), 225-235, 2020 | 23 | 2020 |
Deep Adaptive Temporal Pooling for Activity Recognition S Song, NM Cheung, V Chandrasekhar, B Mandal 2018 ACM Multimedia Conference on Multimedia Conference, 1829--1837, 2018 | 11 | 2018 |
Modeling entities as semantic points for visual information extraction in the wild Z Yang, R Long, P Wang, S Song, H Zhong, W Cheng, X Bai, C Yao Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2023 | 6 | 2023 |
OmniParser: A Unified Framework for Text Spotting Key Information Extraction and Table Recognition J Wan, S Song, W Yu, Y Liu, W Cheng, F Huang, X Bai, C Yao, Z Yang Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024 | 5 | 2024 |
ICDAR 2023 Competition on Born Digital Video Text Question Answering Z Yang, X Song, S Song, T Lu, X Bai, CL Liu, F Huang, C Yao International Conference on Document Analysis and Recognition, 508-521, 2023 | | 2023 |
Towards Multimodal and Secure Deep Learning for Human Activity Recognition from Multiple Views S Song Singapore University of Technology and Design, 2018 | | 2018 |