Bloom: A 176b-parameter open-access multilingual language model T Le Scao, A Fan, C Akiki, E Pavlick, S Ilić, D Hesslow, R Castagné, ... | 1169 | 2023 |
Gender coreference and bias evaluation at wmt 2020 T Kocmi, T Limisiewicz, G Stanovsky Proceedings of the Fifth Conference on Machine Translation, 357-364, 2020 | 33 | 2020 |
Universal dependencies according to BERT: both more specific and more general T Limisiewicz, R Rosa, D Mareček arXiv preprint arXiv:2004.14620, 2020 | 16 | 2020 |
A balanced data approach for evaluating cross-lingual transfer: Mapping the linguistic blood bank D Malkin, T Limisiewicz, G Stanovsky arXiv preprint arXiv:2205.04086, 2022 | 15 | 2022 |
Introducing orthogonal constraint in structural probes T Limisiewicz, D Mareček arXiv preprint arXiv:2012.15228, 2020 | 10 | 2020 |
Syntax Representation in Word Embeddings and Neural Networks--A Survey T Limisiewicz, D Mareček arXiv preprint arXiv:2010.01063, 2020 | 9 | 2020 |
Don't Forget About Pronouns: Removing Gender Bias in Language Models Without Losing Factual Gender Information T Limisiewicz, D Mareček arXiv preprint arXiv:2206.10744, 2022 | 8 | 2022 |
Debiasing algorithm through model adaptation T Limisiewicz, D Mareček, T Musil arXiv preprint arXiv:2310.18913, 2023 | 5 | 2023 |
Tokenization impacts multilingual language modeling: Assessing vocabulary allocation and overlap across languages T Limisiewicz, J Balhar, D Mareček arXiv preprint arXiv:2305.17179, 2023 | 3 | 2023 |
You can have your data and balance it too: towards balanced and efficient multilingual models T Limisiewicz, D Malkin, G Stanovsky arXiv preprint arXiv:2210.07135, 2022 | 3 | 2022 |
Hidden in the Layers: Interpretation of Neural Networks for Natural Language Processing D Mareček, J Libovický, T Musil, R Rosa, T Limisiewicz Ústav formální a aplikované lingvistiky, 2020 | 2 | 2020 |
Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models T Blevins, T Limisiewicz, S Gururangan, M Li, H Gonen, NA Smith, ... arXiv preprint arXiv:2401.10440, 2024 | 1 | 2024 |
Exploring the Impact of Training Data Distribution and Subword Tokenization on Gender Bias in Machine Translation B Iluz, T Limisiewicz, G Stanovsky, D Mareček arXiv preprint arXiv:2309.12491, 2023 | 1 | 2023 |
Ufal submission for sigtyp supervised cognate detection task T Limisiewicz Proceedings of the 5th Workshop on Research in Computational Linguistic …, 2023 | 1 | 2023 |
Examining Cross-lingual Contextual Embeddings with Orthogonal Structural Probes T Limisiewicz, D Mareček arXiv preprint arXiv:2109.04921, 2021 | 1 | 2021 |
MYTE: Morphology-Driven Byte Encoding for Better and Fairer Multilingual Language Modeling T Limisiewicz, T Blevins, H Gonen, O Ahia, L Zettlemoyer arXiv preprint arXiv:2403.10691, 2024 | | 2024 |
Hidden in the Layers D Mareček, J Libovický, R Rosa, T Musil, T Limisiewicz | | 2020 |
Interpreting and Controlling Linguistic Features in Neural Networks’ Representations T Limisiewicz | | |