Follow
Lijuan Wang
Lijuan Wang
Microsoft GenAI
Verified email at microsoft.com - Homepage
Title
Cited by
Cited by
Year
Oscar: Object-Semantics Aligned Pre-training for Vision-Language Tasks
JG Xiujun Li, Xi Yin, Chunyuan Li, Pengchuan Zhang, Xiaowei Hu, Lei Zhang ...
European Conference on Computer Vision (ECCV), 2020
1673*2020
Large Scale Incremental Learning
YF Yue Wu, Yinpeng Chen, Lijuan Wang, Yuancheng Ye, Zicheng Liu, Yandong Guo
The IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019
1097*2019
VinVL: Making Visual Representations Matter in Vision-Language Models
P Zhang, X Li, X Hu, J Yang, L Zhang, L Wang, Y Choi, J Gao
CVPR2021, 2021
921*2021
Florence: A new foundation model for computer vision
L Yuan, D Chen, YL Chen, N Codella, X Dai, J Gao, H Hu, X Huang, B Li, ...
arXiv preprint arXiv:2111.11432, 2021
6052021
Grounded language-image pre-training
LH Li, P Zhang, H Zhang, J Yang, C Li, Y Zhong, L Wang, L Yuan, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
5652022
Rethinking Classification and Localization for Object Detection
YF Yue Wu, Yinpeng Chen, Lu Yuan, Zicheng Liu, Lijuan Wang, Hongzhi Li
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2020
5592020
End-to-End Human Pose and Mesh Reconstruction with Transformers
K Lin, L Wang, Z Liu
CVPR2021, 2020
5292020
End-to-end semi-supervised object detection with soft teacher
M Xu, Z Zhang, H Hu, J Wang, L Wang, F Wei, X Bai, Z Liu
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
3782021
Real-time Animation for an Expressive Avatar
N Xu, L Wang, FKP Soong, X Liang, Q Luo, YQ Xu, X Zou
US Patent App. 12/950,801, 2012
3422012
Refining of segmental boundaries in speech waveforms using contextual-dependent models
Y Zhao, M Chu, JL Zhou, L Wang
US Patent 7,496,512, 2009
3392009
Git: A generative image-to-text transformer for vision and language
J Wang, Z Yang, X Hu, L Li, K Lin, Z Gan, Z Liu, C Liu, L Wang
arXiv preprint arXiv:2205.14100, 2022
2982022
Handwriting-based user interface for correction of speech recognition errors
L Wang, FKP Soong
US Patent App. 12/042,344, 2009
2802009
Unnatural prosody detection in speech synthesis
Y Zhao, FKP Soong, M Chu, L Wang
US Patent 8,583,438, 2013
2632013
An empirical study of training end-to-end vision-and-language transformers
ZY Dou, Y Xu, Z Gan, J Wang, S Wang, L Wang, C Zhu, P Zhang, L Yuan, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
2622022
An empirical study of gpt-3 for few-shot knowledge-based vqa
Z Yang, Z Gan, J Wang, X Hu, Y Lu, Z Liu, L Wang
Proceedings of the AAAI Conference on Artificial Intelligence 36 (3), 3081-3089, 2022
2512022
Mesh graphormer
K Lin, L Wang, Z Liu
Proceedings of the IEEE/CVF international conference on computer vision …, 2021
2352021
Scaling up vision-language pre-training for image captioning
X Hu, Z Gan, J Wang, Z Yang, Z Liu, Y Lu, L Wang
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
1962022
Speech and text driven HMM-based body animation synthesis
L Wang, L Ma, FKP Soong
US Patent 8,224,652, 2012
1832012
Mm-react: Prompting chatgpt for multimodal reasoning and action
Z Yang, L Li, J Wang, K Lin, E Azarnasab, F Ahmed, Z Liu, C Liu, M Zeng, ...
arXiv preprint arXiv:2303.11381, 2023
1662023
Violet: End-to-end video-language transformers with masked visual-token modeling
TJ Fu, L Li, Z Gan, K Lin, WY Wang, L Wang, Z Liu
arXiv preprint arXiv:2111.12681, 2021
1622021
The system can't perform the operation now. Try again later.
Articles 1–20