Ontology-based information extraction from the world wide web

Jan Korst, Gijs Geleijnse, Nick De Jong, Michael Verschoor

Research output: Chapter in Book/Report/Conference proceedingChapterAcademicpeer-review

1 Citation (Scopus)

Abstract

We study possibilities to automatically extract information from the Internet, by structuring and combining data from web pages. The web pages are found with the use of a search engine and the information is structured by using ontologies. The ontologies are populated with the use of statistical and linguistic techniques. We present the results of a case study that is aimed at finding the names of famous persons. The results indicate that, even if we only use the summaries that Google provides of web pages, the approach results in a high precision and recall for the specific application.

Original languageEnglish
Title of host publicationIntelligent Algorithms in Ambient and Biomedical Computing
EditorsW. Verhaegh, E. Aarts, J. Korst
Place of PublicationBerlin
PublisherSpringer
Chapter10
Pages149-167
Number of pages19
ISBN (Print)9781402049538
DOIs
Publication statusPublished - 1 Dec 2006

Keywords

  • famous persons
  • Google
  • Information extraction
  • ontology
  • World Wide Web

Fingerprint Dive into the research topics of 'Ontology-based information extraction from the world wide web'. Together they form a unique fingerprint.

Cite this