Who are my ancestors? : retrieving family relationships from historical texts

I. Efremova, A. Montes Garcia, A.J. Bolt Iriondo, T.G.K. Calders

    Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

    1 Citation (Scopus)
    1 Downloads (Pure)


    This paper presents an approach for automatically retrieving family relationships from a real-world collection of Dutch historical notary acts. We aim to retrieve relationships like husband - wife, parent - child, widow of, etc. Our approach includes person names extraction, reference disambiguation, candidate generation and family relationship prediction. Since we have a limited amount of training data, we evaluate different feature configurations based on the n-gram analysis. The best results were obtained by using a combination of bi-grams and trigrams of words together with the distance in words between two names. We evaluate our results for each type of the relationships in terms of precision, recall and f - score.
    Original languageEnglish
    Title of host publicationInformation Retrieval
    Subtitle of host publication9th Russian Summer School, RuSSIR 2015, Saint Petersburg, Russia, August 24-28, 2015, Revised Selected Papers
    EditorsP. Braslavski , I. Markov, P. Pardalos, Y. Volkovich, D.I. Ignatov , S. Koltsov, O. Koltsova
    Place of PublicationBerlin
    ISBN (Electronic)978-3-319-41718-9
    ISBN (Print)978-3-319-41717-2
    Publication statusPublished - 2015

    Publication series

    NameCommunications in Computer and Information Science
    ISSN (Print)1865-0929

    Cite this