Follow
Jade Copet
Title
Cited by
Cited by
Year
Code llama: Open foundation models for code
B Roziere, J Gehring, F Gloeckle, S Sootla, I Gat, XE Tan, Y Adi, J Liu, ...
arXiv preprint arXiv:2308.12950, 2023
2892023
On generative spoken language modeling from raw audio
K Lakhotia, E Kharitonov, WN Hsu, Y Adi, A Polyak, B Bolte, TA Nguyen, ...
Transactions of the Association for Computational Linguistics 9, 1336-1354, 2021
2182021
Speech resynthesis from discrete disentangled self-supervised representations
A Polyak, Y Adi, J Copet, E Kharitonov, K Lakhotia, WN Hsu, A Mohamed, ...
arXiv preprint arXiv:2104.00355, 2021
2002021
High fidelity neural audio compression
A Défossez, J Copet, G Synnaeve, Y Adi
arXiv preprint arXiv:2210.13438, 2022
1892022
Audiogen: Textually guided audio generation
F Kreuk, G Synnaeve, A Polyak, U Singer, A Défossez, J Copet, D Parikh, ...
arXiv preprint arXiv:2209.15352, 2022
1362022
Simple and controllable music generation
J Copet, F Kreuk, I Gat, T Remez, D Kant, G Synnaeve, Y Adi, A Défossez
Advances in Neural Information Processing Systems 36, 2024
792024
Text-free prosody-aware generative spoken language modeling
E Kharitonov, A Lee, A Polyak, Y Adi, J Copet, K Lakhotia, TA Nguyen, ...
arXiv preprint arXiv:2109.03264, 2021
732021
Generative spoken dialogue language modeling
TA Nguyen, E Kharitonov, J Copet, Y Adi, WN Hsu, A Elkahky, ...
Transactions of the Association for Computational Linguistics 11, 250-266, 2023
472023
Textless Speech Emotion Conversion using Discrete and Decomposed Representations
F Kreuk, A Polyak, J Copet, E Kharitonov, TA Nguyen, M Rivière, WN Hsu, ...
arXiv preprint arXiv:2111.07402, 2021
472021
Stop: A dataset for spoken task oriented semantic parsing
P Tomasello, A Shrivastava, D Lazar, PC Hsu, D Le, A Sagar, A Elkahky, ...
2022 IEEE Spoken Language Technology Workshop (SLT), 991-998, 2023
232023
Textually pretrained speech language models
M Hassid, T Remez, TA Nguyen, I Gat, A Conneau, F Kreuk, J Copet, ...
Advances in Neural Information Processing Systems 36, 2024
132024
ASR4REAL: An extended benchmark for speech models
M Riviere, J Copet, G Synnaeve
arXiv preprint arXiv:2110.08583, 2021
132021
Brouhaha: multi-task training for voice activity detection, speech-to-noise ratio, and C50 room acoustics estimation
M Lavechin, M Métais, H Titeux, A Boissonnet, J Copet, M Rivière, ...
2023 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU), 1-7, 2023
102023
Expresso: A benchmark and analysis of discrete expressive speech resynthesis
TA Nguyen, WN Hsu, A d'Avirro, B Shi, I Gat, M Fazel-Zarani, T Remez, ...
arXiv preprint arXiv:2308.05725, 2023
82023
On the robustness of self-supervised representations for spoken language modeling
I Gat, F Kreuk, A Lee, J Copet, G Synnaeve, E Dupoux, Y Adi
arXiv preprint arXiv:2209.15483, 2022
72022
textless-lib: A library for textless spoken language processing
E Kharitonov, J Copet, K Lakhotia, TA Nguyen, P Tomasello, A Lee, ...
arXiv preprint arXiv:2202.07359, 2022
72022
Augmentation invariant discrete representation for generative spoken language modeling
I Gat, F Kreuk, TA Nguyen, A Lee, J Copet, G Synnaeve, E Dupoux, Y Adi
Proceedings of the 20th International Conference on Spoken Language …, 2023
52023
Audio language modeling using perceptually-guided discrete representations
F Kreuk, Y Taigman, A Polyak, J Copet, G Synnaeve, A Défossez, Y Adi
arXiv preprint arXiv:2211.01223, 2022
42022
Do Coarser Units Benefit Cluster Prediction-Based Speech Pre-Training?
A Elkahky, WN Hsu, P Tomasello, TA Nguyen, R Algayres, Y Adi, J Copet, ...
ICASSP 2023-2023 IEEE International Conference on Acoustics, Speech and …, 2023
12023
Masked Audio Generation using a Single Non-Autoregressive Transformer
A Ziv, I Gat, GL Lan, T Remez, F Kreuk, A Défossez, J Copet, G Synnaeve, ...
arXiv preprint arXiv:2401.04577, 2024
2024
The system can't perform the operation now. Try again later.
Articles 1–20