TY - GEN
T1 - Investigation of a baseline method for genealogical entity resolution
AU - Efremova, I.
AU - Ranjbar-Sahraei, B.
AU - Oliehoek, F.A.
AU - Calders, T.G.K.
AU - Tuyls, K.P.
PY - 2014
Y1 - 2014
N2 - In this paper we study the application of entity resolution (ER) techniques on
a real-world multi-source genealogical dataset. Our goal is to identify all persons
involved in various notary acts and link them to their birth, marriage and death
certificates. In order to evaluate the performance of a baseline approach based on existing techniques, an interactive interface is developed for getting feedback from human experts in the field of genealogy. We perform an empirical evaluation in terms of precision, recall and F-score. We show that the baseline approach is not sufficient for our purposes and discuss future improvements.
AB - In this paper we study the application of entity resolution (ER) techniques on
a real-world multi-source genealogical dataset. Our goal is to identify all persons
involved in various notary acts and link them to their birth, marriage and death
certificates. In order to evaluate the performance of a baseline approach based on existing techniques, an interactive interface is developed for getting feedback from human experts in the field of genealogy. We perform an empirical evaluation in terms of precision, recall and F-score. We show that the baseline approach is not sufficient for our purposes and discuss future improvements.
UR - https://socialhistory.org/sites/default/files/docs/efremova_et_al_-_baseline_method_entity_resolution.pdf
M3 - Conference contribution
BT - Workshop on 'Population Reconstruction', 19 February 2014, Amsterdam, The Netherlands
PB - International Institute for Social History (IISH)
CY - Amsterdam
ER -