The AMTEx approach in the medical document indexing and retrieval application

  • Angelos Hliaoutakis
  • , Kaliope Zervanou
  • , Euripides G.M. Petrakis

Research output: Contribution to journalArticleAcademicpeer-review

Abstract

AMTEx is a medical document indexing method, specifically designed for the automatic indexing of documents in large medical collections, such as MEDLINE, the premier bibliographic database of the US National Library of Medicine (NLM). AMTEx combines MeSH, the terminological thesaurus resource of NLM, with a well-established method for extraction of terminology, the C/NC-value method. The performance evaluation of two AMTEx configurations is measured against the current state-of-the-art, the MetaMap Transfer (MMTx) method in four experiments, using two types of corpora: a subset of MEDLINE (PMC) full document corpus and a subset of MEDLINE (OHSUMED) abstracts, for each of the indexing and retrieval tasks, respectively. The experimental results demonstrate that AMTEx performs better in indexing in 20-50% of the processing time compared to MMTx, while for the retrieval task, AMTEx performs better in the full text (PMC) corpus.

Original languageEnglish
Pages (from-to)380-392
Number of pages13
JournalData & Knowledge Engineering
Volume68
Issue number3
DOIs
Publication statusPublished - 1 Mar 2009
Externally publishedYes

Funding

This work was supported by project TOWL (FP6-STREP, Project No. 026896) of the European Union (EU). We would like to thank Dimitris Makreas, MD of Greek National Health Care System, for his valuable contribution in this work. Dr. Makreas proposed a methodology for the intellectual evaluation of the PMC answer sets. He has furthermore carried out the relevance judgments for our experiments, a work of crucial importance for any well-founded evaluation of retrieval techniques.

Keywords

  • AMTEx
  • Document indexing
  • Medical document retrieval
  • MMTx
  • Term extraction

Fingerprint

Dive into the research topics of 'The AMTEx approach in the medical document indexing and retrieval application'. Together they form a unique fingerprint.

Cite this