A method for automating the extraction of specialized information from the web

L. Lin, A. Liotta, A. Hippisley

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

3 Citations (Scopus)


The World Wide Web can be viewed as a gigantic distributed database including millions of interconnected hosts some of which publish information via web servers or peer-to-peer systems. We present here a novel method for the extraction of semantically rich information from the web in a fully automated fashion. We illustrate our approach via a proof-of-concept application which scrutinizes millions of web pages looking for clues as to the trend of the Chinese stock market. We present the outcomes of a 210-day long study which indicates a strong correlation between the information retrieved by our prototype and the actual market behavior.
Original languageEnglish
Title of host publicationComputational intelligence and security : international conference, CIS 2005, Xi'an, China, December 15-19, 2005 : proceedings, part I
EditorsY. Hao, J. Liu, Y. Wang, Y-M. Cheung, H. Yin, L. Jiao, j. Ma, Y-C. Jiao
Place of PublicationBerlin
ISBN (Print)3-540-30818-0
Publication statusPublished - 2005

Publication series

NameLecture Notes in Computer Science
ISSN (Print)0302-9743


Dive into the research topics of 'A method for automating the extraction of specialized information from the web'. Together they form a unique fingerprint.

Cite this