Who are my ancestors? : retrieving family relationships from historical texts

I. Efremova, A. Montes Garcia, A.J. Bolt Iriondo, T.G.K. Calders

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

1 Citation (Scopus)
1 Downloads (Pure)

Abstract

This paper presents an approach for automatically retrieving family relationships from a real-world collection of Dutch historical notary acts. We aim to retrieve relationships like husband - wife, parent - child, widow of, etc. Our approach includes person names extraction, reference disambiguation, candidate generation and family relationship prediction. Since we have a limited amount of training data, we evaluate different feature configurations based on the n-gram analysis. The best results were obtained by using a combination of bi-grams and trigrams of words together with the distance in words between two names. We evaluate our results for each type of the relationships in terms of precision, recall and f - score.
Original languageEnglish
Title of host publicationInformation Retrieval
Subtitle of host publication9th Russian Summer School, RuSSIR 2015, Saint Petersburg, Russia, August 24-28, 2015, Revised Selected Papers
EditorsP. Braslavski , I. Markov, P. Pardalos, Y. Volkovich, D.I. Ignatov , S. Koltsov, O. Koltsova
Place of PublicationBerlin
PublisherSpringer
Pages121-129
ISBN (Electronic)978-3-319-41718-9
ISBN (Print)978-3-319-41717-2
DOIs
Publication statusPublished - 2015

Publication series

NameCommunications in Computer and Information Science
Volume573
ISSN (Print)1865-0929

Cite this