Follow
Luca Soldaini
Luca Soldaini
Allen Institute for AI
Verified email at allenai.org - Homepage
Title
Cited by
Cited by
Year
QuickUMLS: a fast, unsupervised approach for medical concept extraction
L Soldaini, N Goharian
Medical Information Retrieval (MedIR) Workshop at SIGIR 2016, 2016
2682016
SMHD: a large-scale resource for exploring online language usage for multiple mental health conditions
A Cohan, B Desmet, A Yates, L Soldaini, S MacAvaney, N Goharian
arXiv preprint arXiv:1806.05258, 2018
1852018
Olmo: Accelerating the science of language models
D Groeneveld, I Beltagy, P Walsh, A Bhagia, R Kinney, O Tafjord, AH Jha, ...
🏆 Best Paper Award 🏆 ACL 2024, 2024
176*2024
Dolma: An open corpus of three trillion tokens for language model pretraining research
L Soldaini, R Kinney, A Bhagia, D Schwenk, D Atkinson, R Authur, ...
🏆 Best Paper Award 🏆 ACL 2024., 2024
136*2024
Don’t parse, generate! a sequence to sequence architecture for task-oriented semantic parsing
S Rongali, L Soldaini, E Monti, W Hamza
Proceedings of the web conference 2020, 2962-2968, 2020
1192020
The semantic scholar open data platform
R Kinney, C Anastasiades, R Authur, I Beltagy, J Bragg, A Buraczynski, ...
arXiv preprint arXiv:2301.10140, 2023
1032023
What's In My Big Data?
Y Elazar, A Bhagia, I Magnusson, A Ravichander, D Schwenk, A Suhr, ...
arXiv preprint arXiv:2310.20707, 2023
672023
Enhancing web search in the medical domain via query clarification
L Soldaini, A Yates, E Yom-Tov, O Frieder, N Goharian
Information Retrieval Journal 19, 149-173, 2016
592016
The cascade transformer: an application for efficient answer sentence selection
L Soldaini, A Moschitti
arXiv preprint arXiv:2005.02534, 2020
572020
Rsdd-time: Temporal annotation of self-reported mental health diagnoses
S MacAvaney, B Desmet, A Cohan, L Soldaini, A Yates, A Zirikly, ...
arXiv preprint arXiv:1806.07916, 2018
512018
Datacomp-lm: In search of the next generation of training sets for language models
J Li, A Fang, G Smyrnis, M Ivgi, M Jordan, S Gadre, H Bansal, E Guha, ...
arXiv preprint arXiv:2406.11794, 2024
48*2024
Retrieving medical literature for clinical decision support
L Soldaini, A Cohan, A Yates, N Goharian, O Frieder
Advances in Information Retrieval: 37th European Conference on IR Research …, 2015
472015
One-shot labeling for automatic relevance estimation
S MacAvaney, L Soldaini
Proceedings of the 46th International ACM SIGIR Conference on Research and …, 2023
452023
Teaching a new dog old tricks: Resurrecting multilingual retrieval using zero-shot learning
S MacAvaney, L Soldaini, N Goharian
Advances in Information Retrieval: 42nd European Conference on IR Research …, 2020
352020
Scim: Intelligent skimming support for scientific papers
R Fok, H Kambhamettu, L Soldaini, J Bragg, K Lo, M Hearst, A Head, ...
Proceedings of the 28th International Conference on Intelligent User …, 2023
342023
Learning to rank for consumer health search: a semantic approach
L Soldaini, N Goharian
Advances in Information Retrieval: 39th European Conference on IR Research …, 2017
332017
Answer generation for retrieval-based question answering systems
CC Hsu, E Lind, L Soldaini, A Moschitti
arXiv preprint arXiv:2106.00955, 2021
282021
Matching Citation Text and Cited Spans in Biomedical Literature: a Search-Oriented Approach
A Cohan, L Soldaini, N Goharian
North American Chapter of the Association for Computational Linguistics …, 2015
272015
peS2o (Pretraining Efficiently on S2ORC) Dataset
L Soldaini, K Lo
Allen Institute for AI, Tech. Rep, 2023
262023
Overview of the TREC 2023 NeuCLIR Track
D Lawrie, S MacAvaney, J Mayfield, P McNamee, DW Oard, L Soldaini, ...
arXiv preprint arXiv:2404.08071, 2024
252024
The system can't perform the operation now. Try again later.
Articles 1–20