Abstract
In this paper we study the application of entity resolution (ER) techniques on
a real-world multi-source genealogical dataset. Our goal is to identify all persons
involved in various notary acts and link them to their birth, marriage and death
certificates. In order to evaluate the performance of a baseline approach based on existing techniques, an interactive interface is developed for getting feedback from human experts in the field of genealogy. We perform an empirical evaluation in terms of precision, recall and F-score. We show that the baseline approach is not sufficient for our purposes and discuss future improvements.
Original language | English |
---|---|
Title of host publication | Workshop on 'Population Reconstruction', 19 February 2014, Amsterdam, The Netherlands |
Place of Publication | Amsterdam |
Publisher | International Institute for Social History (IISH) |
Publication status | Published - 2014 |