Follow
Alexander Richard
Alexander Richard
Research Scientist, Facebook Reality Labs
Verified email at fb.com - Homepage
Title
Cited by
Cited by
Year
Temporal action detection using a statistical language model
A Richard, J Gall
Proceedings of the IEEE conference on computer vision and pattern …, 2016
2562016
Weakly supervised action learning with rnn based fine-to-coarse modeling
A Richard, H Kuehne, J Gall
Proceedings of the IEEE conference on Computer Vision and Pattern …, 2017
2322017
When will you do what?-anticipating temporal occurrences of activities
Y Abu Farha, A Richard, J Gall
Proceedings of the IEEE conference on computer vision and pattern …, 2018
1902018
Neuralnetwork-viterbi: A framework for weakly supervised video learning
A Richard, H Kuehne, A Iqbal, J Gall
Proceedings of the IEEE conference on Computer Vision and Pattern …, 2018
1472018
Weakly supervised learning of actions from transcripts
H Kuehne, A Richard, J Gall
Computer Vision and Image Understanding 163, 78-89, 2017
1292017
Meshtalk: 3d face animation from speech using cross-modality disentanglement
A Richard, M Zollhöfer, Y Wen, F De la Torre, Y Sheikh
Proceedings of the IEEE/CVF International Conference on Computer Vision …, 2021
1272021
Conditional diffusion probabilistic model for speech enhancement
YJ Lu, ZQ Wang, S Watanabe, A Richard, C Yu, Y Tsao
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
1022022
Action sets: Weakly supervised action segmentation without ordering constraints
A Richard, H Kuehne, J Gall
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
982018
A hybrid rnn-hmm approach for weakly supervised temporal action segmentation
H Kuehne, A Richard, J Gall
IEEE transactions on pattern analysis and machine intelligence 42 (4), 765-779, 2018
862018
Mean-normalized stochastic gradient for large-scale deep learning
S Wiesler, A Richard, R Schlüter, H Ney
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
832014
Audio-and gaze-driven facial animation of codec avatars
A Richard, C Lea, S Ma, J Gall, F De la Torre, Y Sheikh
Proceedings of the IEEE/CVF winter conference on applications of computer …, 2021
682021
RASR/NN: The RWTH neural network toolkit for speech recognition
S Wiesler, A Richard, P Golik, R Schlüter, H Ney
2014 IEEE International Conference on Acoustics, Speech and Signal …, 2014
612014
A bag-of-words equivalent recurrent neural network for action recognition
A Richard, J Gall
Computer Vision and Image Understanding 156, 79-91, 2017
572017
Neural Synthesis of Binaural Speech From Mono Audio
A Richard, D Markovic, ID Gebru, S Krenn, GA Butler, F Torre, Y Sheikh
International Conference on Learning Representations, 2021
452021
Multiface: A dataset for neural face rendering
C Wuu, N Zheng, S Ardisson, R Bali, D Belko, E Brockmeyer, L Evans, ...
arXiv preprint arXiv:2207.11243, 2022
392022
Deep impulse responses: Estimating and parameterizing filters with deep networks
A Richard, P Dodds, VK Ithapu
ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and …, 2022
262022
Audio-visual speech codecs: Rethinking audio-visual speech enhancement by re-synthesis
K Yang, D Marković, S Krenn, V Agrawal, A Richard
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
262022
Implicit hrtf modeling using temporal convolutional networks
ID Gebru, D Marković, A Richard, S Krenn, GA Butler, F De la Torre, ...
ICASSP 2021-2021 IEEE International Conference on Acoustics, Speech and …, 2021
232021
Audiodec: An open-source streaming high-fidelity neural audio codec
YC Wu, ID Gebru, D Marković, A Richard
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
192023
Mining youtube-a dataset for learning fine-grained action concepts from webly supervised video data
H Kuehne, A Iqbal, A Richard, J Gall
arXiv preprint arXiv:1906.01012, 2019
182019
The system can't perform the operation now. Try again later.
Articles 1–20