TY - GEN
T1 - The characteristics and performance of groups of jobs in grids
AU - Iosup, A.
AU - Jan, M.
AU - Sonmez, O.O.
AU - Epema, D.H.J.
PY - 2007
Y1 - 2007
N2 - Even though with few exceptions, grid workloads are dominated by single-node jobs, not all of these jobs are necessarily independent or unrelated. For instance, sets of jobs may be grouped because they are submitted by users in batches, e.g., to perform parameter sweeps. However, there is no reported data to confirm the presence and structure of these groupings, despite the large potential impact of such information. To address this lack of information, in this work we present a first investigation into the characteristics of groups of jobs present in grid workloads. First, we define three types of job groupings: batch, continued, and bursty submissions. Then, we analyze the characteristics of these groupings for three long-term traces from currently deployed grid environments. Notably, our results show that the various groupings are responsible for up to 96% of the total CPU time consumption. Finally, we present insights into the performance of real grids in dealing with grouped jobs.
AB - Even though with few exceptions, grid workloads are dominated by single-node jobs, not all of these jobs are necessarily independent or unrelated. For instance, sets of jobs may be grouped because they are submitted by users in batches, e.g., to perform parameter sweeps. However, there is no reported data to confirm the presence and structure of these groupings, despite the large potential impact of such information. To address this lack of information, in this work we present a first investigation into the characteristics of groups of jobs present in grid workloads. First, we define three types of job groupings: batch, continued, and bursty submissions. Then, we analyze the characteristics of these groupings for three long-term traces from currently deployed grid environments. Notably, our results show that the various groupings are responsible for up to 96% of the total CPU time consumption. Finally, we present insights into the performance of real grids in dealing with grouped jobs.
U2 - 10.1007/978-3-540-74466-5_42
DO - 10.1007/978-3-540-74466-5_42
M3 - Conference contribution
SN - 3-540-74465-7
T3 - Lecture Notes in Computer Science
SP - 382
EP - 393
BT - Euro-Par 2007 - Parallel Processing (13th International Euro-Par Conference, Rennes, France, August 28-31, 2007. Proceedings)
A2 - Kermarrec, A.M.
A2 - Bougé, L.
A2 - Priol, T.
PB - Springer
CY - Berlin
ER -