TY - GEN
T1 - On the usage of global document occurrences in Peer-to-Peer information systems
AU - Papapetrou, Odysseas
AU - Michel, Sebastian
AU - Bender, Matthias
AU - Weikum, Gerhard
PY - 2005/12/1
Y1 - 2005/12/1
N2 - There exist a number of approaches for query processing in Peer-to-Peer information systems that efficiently retrieve relevant information from distributed peers. However, very few of them take into consideration the overlap between peers: as the most popular resources (e.g., documents or files) are often present at most of the peers, a large fraction of the documents eventually received by the query initiator are duplicates. We develop a technique based on the notion of global document occurrences (GDO) that, when processing a query, penalizes frequent documents increasingly as more and more peers contribute their local results. We argue that the additional effort to create and maintain the GDO information is reasonably low, as the necessary information can be piggybacked onto the existing communication. Early experiments indicate that our approach significantly decreases the number of peers that have to be involved in a query to reach a certain level of recall and, thus, decreases user-perceived latency and the wastage of network resources.
AB - There exist a number of approaches for query processing in Peer-to-Peer information systems that efficiently retrieve relevant information from distributed peers. However, very few of them take into consideration the overlap between peers: as the most popular resources (e.g., documents or files) are often present at most of the peers, a large fraction of the documents eventually received by the query initiator are duplicates. We develop a technique based on the notion of global document occurrences (GDO) that, when processing a query, penalizes frequent documents increasingly as more and more peers contribute their local results. We argue that the additional effort to create and maintain the GDO information is reasonably low, as the necessary information can be piggybacked onto the existing communication. Early experiments indicate that our approach significantly decreases the number of peers that have to be involved in a query to reach a certain level of recall and, thus, decreases user-perceived latency and the wastage of network resources.
UR - http://www.scopus.com/inward/record.url?scp=33646703798&partnerID=8YFLogxK
U2 - 10.1007/11575771_21
DO - 10.1007/11575771_21
M3 - Conference contribution
AN - SCOPUS:33646703798
SN - 3540297367
SN - 9783540297369
T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
SP - 310
EP - 328
BT - On the Move to Meaningful Internet Systems 2005
T2 - Confederated International Conference On the Move to Meaningful Internet Systems, OTM 2005 held in conjunction with Conferences on CoopIS, CandTC and ODBASE 2005
Y2 - 31 October 2005 through 4 November 2005
ER -