A gold standard methodology for evaluating accuracy in data-to-text systems C Thomson, E Reiter arXiv preprint arXiv:2011.03992, 2020 | 27 | 2020 |
Underreporting of errors in NLG output, and what to do about it E Van Miltenburg, MA Clinciu, O Dušek, D Gkatzia, S Inglis, L Leppänen, ... arXiv preprint arXiv:2108.01182, 2021 | 17 | 2021 |
SportSett: basketball-a robust and maintainable data-set for natural language generation C Thomson, E Reiter, S Sripada Proceedings of the Workshop on Intelligent Information Processing and …, 2020 | 14 | 2020 |
Generation challenges: Results of the accuracy evaluation shared task C Thomson, E Reiter arXiv preprint arXiv:2108.05644, 2021 | 10 | 2021 |
Shared task on evaluating accuracy E Reiter, CA Thomson | 9 | 2020 |
Studying the impact of filling information gaps on the output quality of neural data-to-text CA Thomson, Z Zhao, SG Sripada | 5 | 2020 |
Comprehension driven document planning in natural language generation systems C Thomson, E Reiter, S Sripada Proceedings of The 11th International Natural Language Generation Conference, 2018 | 3 | 2018 |
Gemv2: Multilingual nlg benchmarking in a single line of code S Gehrmann, A Bhattacharjee, A Mahendiran, A Wang, A Papangelis, ... arXiv preprint arXiv:2206.11249, 2022 | 2 | 2022 |
The accuracy evaluation shared task as a retrospective reproduction study C Thomson, E Reiter Proceedings of the 15th International Conference on Natural Language …, 2022 | 1 | 2022 |
Barriers and enabling factors for error analysis in NLG research E Van Miltenburg, M Clinciu, O Dušek, D Gkatzia, S Inglis, L Leppänen, ... Northern European Journal of Language Technology 9 (1), 2023 | | 2023 |
Evaluating factual accuracy in complex data-to-text C Thomson, E Reiter, B Sundararajan Computer Speech & Language, 101482, 2023 | | 2023 |
Shared Task on Evaluating Accuracy in Natural Language Generation E Reiter, C Thomson arXiv preprint arXiv:2006.12234, 2020 | | 2020 |