Follow
Kaitao Song
Kaitao Song
Verified email at microsoft.com
Title
Cited by
Cited by
Year
Pyramid vision transformer: A versatile backbone for dense prediction without convolutions
W Wang, E Xie, X Li, DP Fan, K Song, D Liang, T Lu, P Luo, L Shao
ICCV 2021, 2021
12472021
Mass: Masked sequence to sequence pre-training for language generation
K Song, X Tan, T Qin, J Lu, TY Liu
ICML 2019, 2019
8022019
Pvt v2: Improved baselines with pyramid vision transformer
W Wang, E Xie, X Li, DP Fan, K Song, D Liang, T Lu, P Luo, L Shao
Computational Visual Media 8 (3), 415-424, 2022
2742022
Mpnet: Masked and permuted pre-training for language understanding
K Song, X Tan, T Qin, J Lu, TY Liu
NeurIPS 2020, 2020
2642020
SongMASS: Automatic Song Writing with Pre-training and Alignment Constraint
Z Sheng, K Song, X Tan, Y Ren, W Ye, S Zhang, T Qin
AAAI 2021, 2020
262020
NAS-BERT: Task-Agnostic and Adaptive-Size BERT Compression with Neural Architecture Search
J Xu, X Tan, R Luo, K Song, J Li, T Qin, TY Liu
KDD 2021, 2021
252021
Bi-modal progressive mask attention for fine-grained recognition
K Song, XS Wei, X Shu, RJ Song, J Lu
IEEE Transactions on Image Processing 29, 7006-7018, 2020
252020
Generating adversarial examples with conditional generative adversarial net
P Yu, K Song, J Lu
2018 24th international conference on pattern recognition (ICPR), 676-681, 2018
212018
Double path networks for sequence to sequence learning
K Song, X Tan, D He, J Lu, T Qin, TY Liu
COLING 2018, 2018
152018
Hybrid self-attention network for machine translation
K Song, X Tan, F Peng, J Lu
arXiv preprint arXiv:1811.00253, 2018
122018
Analyzing and Mitigating Interference in Neural Architecture Search
J Xu, X Tan, K Song, R Luo, Y Leng, T Qin, TY Liu, J Li
ICML 2022, 2021
102021
LightPAFF: A two-stage distillation framework for pre-training and fine-tuning
K Song, H Sun, X Tan, T Qin, J Lu, H Liu, TY Liu
arXiv preprint arXiv:2004.12817, 2020
102020
DeepRapper: Neural Rap Generation with Rhyme and Rhythm Modeling
L Xue, K Song, D Wu, X Tan, NL Zhang, T Qin, WQ Zhang, TY Liu
ACL 2021, 2021
92021
Coarse-to-fine: A dual-view attention network for click-through rate prediction
K Song, Q Huang, F Zhang, J Lu
Knowledge-Based Systems 216, 106767, 2021
92021
Neural Machine Translation with Error Correction
K Song, X Tan, J Lu
IJCAI 2020, 2020
72020
Mixed-phoneme bert: Improving bert with mixed phoneme and sup-phoneme representations for text to speech
G Zhang, K Song, X Tan, D Tan, Y Yan, Y Liu, G Wang, W Zhou, T Qin, ...
INTERSPEECH 2022, 2022
52022
A study on the efficacy of model pre-training in developing neural text-to-speech system
G Zhang, Y Leng, D Tan, Y Qin, K Song, X Tan, S Zhao, T Lee
ICASSP 2022, 2021
22021
Task-agnostic and adaptive-size bert compression
J Xu, X Tan, R Luo, K Song, L Jian, T Qin, TY Liu
12021
Learning Domain Invariant Prompt for Vision-Language Models
C Zhao, Y Wang, X Jiang, Y Shen, K Song, D Li, D Miao
arXiv preprint arXiv:2212.04196, 2022
2022
SoftCorrect: Error Correction with Soft Detection for Automatic Speech Recognition
Y Leng, X Tan, W Liu, K Song, R Wang, XY Li, T Qin, E Lin, TY Liu
AAAI 2023, 2022
2022
The system can't perform the operation now. Try again later.
Articles 1–20