Automatic reclustering of objects in very large databases for high energy physics

K.J.G. Holtman, I.M. Willers, P.D.V. Stok, van der

Research output: Book/ReportReportAcademic

120 Downloads (Pure)

Abstract

In the very large object database systems planned for some future particle physics experiments, typical physics analysis jobs will traverse millions of read-only objects, many more objects than fit in the database cache. Thus, a good clustering of objects on disk is highly critical to database performance. We present the implementation and performance measurements of a prototype reclustering mechanism which was developed to optimise I/O performance under the changing access patterns in a high energy physics database. Reclustering is done automatically and on-line. The methods used by our prototype differ greatly from those commonly found in proposed general-purpose reclustering systems. By exploiting some special characteristics of the access patterns of physics analysis jobs, the prototype manages to keep database I/O throughput close to the optimum throughput of raw sequential disk access.
Original languageEnglish
Place of PublicationGeneva, Switzerland
PublisherCERN
Publication statusPublished - 1998

Publication series

NameCMS Conference Report
Volume1998/008

Fingerprint

Dive into the research topics of 'Automatic reclustering of objects in very large databases for high energy physics'. Together they form a unique fingerprint.

Cite this