Roadrunner: Towards automatic data extraction from large web sites V Crescenzi, G Mecca, P Merialdo VLDB 1, 109-118, 2001 | 1574 | 2001 |
Automatic information extraction from large websites V Crescenzi, G Mecca Journal of the ACM (JACM) 51 (5), 731-779, 2004 | 249 | 2004 |
Grammars have exceptions V Crescenzi, G Mecca Information Systems 23 (8), 539-565, 1998 | 222 | 1998 |
Automatic annotation of data extracted from large Web sites. L Arlotta, V Crescenzi, G Mecca, P Merialdo WebDB, 7-12, 2003 | 167 | 2003 |
Clustering web pages based on their structure V Crescenzi, P Merialdo, P Missier Data & Knowledge Engineering 54 (3), 279-299, 2005 | 120 | 2005 |
Roadrunner: automatic data extraction from data-intensive web sites V Crescenzi, G Mecca, P Merialdo Proceedings of the 2002 ACM SIGMOD international conference on Management of …, 2002 | 102 | 2002 |
Probabilistic Models to Reconcile Complex Data from Inaccurate Data Sources. L Blanco, V Crescenzi, P Merialdo, P Papotti CAiSE 6051, 83-97, 2010 | 82 | 2010 |
Extraction and integration of partially overlapping web sources M Bronzi, V Crescenzi, P Merialdo, P Papotti Proceedings of the VLDB Endowment 6 (10), 805-816, 2013 | 77 | 2013 |
The (Short) Araneus Guide to Web-Site Development. G Mecca, P Merialdo, P Atzeni, V Crescenzi, V Crescenzi WebDB (Informal Proceedings), 13-18, 1999 | 55 | 1999 |
The ARANEUS Guide to Web-Site Development. G Mecca, P Merialdo, P Atzeni, V Crescenzi SEBD 1999, 167-177, 1999 | 47 | 1999 |
Automatic Web Information Extraction in the RoadRunner System V Crescenzi, G Mecca, P Merialdo Conceptual Modeling for New Information Systems Technologies: ER 2001 …, 2002 | 43 | 2002 |
Wrapping-oriented classification of web pages V Crescenzi, G Mecca, P Merialdo Proceedings of the 2002 ACM symposium on Applied computing, 1108-1112, 2002 | 41 | 2002 |
Wrapper inference for ambiguous web pages V Crescenzi, P Merialdo Applied Artificial Intelligence 22 (1-2), 21-52, 2008 | 35 | 2008 |
A framework for learning web wrappers from the crowd V Crescenzi, P Merialdo, D Qiu Proceedings of the 22nd international conference on World Wide Web, 261-272, 2013 | 34 | 2013 |
Web content extraction: a metaanalysis of its past and thoughts on its future T Weninger, R Palacios, V Crescenzi, T Gottron, P Merialdo ACM SIGKDD Explorations Newsletter 17 (2), 17-23, 2016 | 28 | 2016 |
Crawling programs for wrapper-based applications C Bertoli, V Crescenzi, P Merialdo 2008 IEEE International Conference on Information Reuse and Integration, 160-165, 2008 | 28 | 2008 |
Crowdsourcing for data management V Crescenzi, AAA Fernandes, P Merialdo, NW Paton Knowledge and Information Systems 53, 1-41, 2017 | 25 | 2017 |
Supporting the automatic construction of entity aware search engines L Blanco, V Crescenzi, P Merialdo, P Papotti Proceedings of the 10th ACM workshop on Web information and data management …, 2008 | 25 | 2008 |
Crowdsourcing large scale wrapper inference V Crescenzi, P Merialdo, D Qiu Distributed and Parallel Databases 33, 95-122, 2015 | 24 | 2015 |
Efficiently Locating Collections of Web Pages to Wrap. L Blanco, V Crescenzi, P Merialdo WEBIST, 247-254, 2005 | 21 | 2005 |