URL study guide
https://tue.osiris-student.nl/onderwijscatalogus/extern/cursus?cursuscode=2AMD15&collegejaar=2025&taal=enDescription
We study models of contemporary data intensive systems and their practical use. These models are among: Parallel and distributed data processing (MapReduce, etc.), Data Stream Management Systems and Approximate Query Processing.We discuss why these models were introduced, their relative advantages and disadvantages, how to use them in practice, and, at a high level, how they are implemented. Unlike the subject Engineering Data-Intensive Systems (2IMD10) which focuses primarily on systems internals and their efficient implementation, the primary goal of this course is to develop the practical ability to engineer non-trivial data intensive and big data applications based on a solid understanding of the underlying engineering principles. Towards this goal, hands-on practical assignment(s) using contemporary frameworks and technologies are a central component of the course. Students should know programming in Python and/or Java.
Objectives
Students will:(1) get to know the main characteristics and relevant research results for models of contemporary data-intensive and big data systems;
(2) understand the practical relevance of these models for engineering data-intensive and big data applications;
(3) understand the relative advantages and disadvantages of these models and acquire the ability to decide, based on a problem description, which model is best suited to solve this problem; and,
(4) be able to quickly master and make practical use of contemporary frameworks and technologies implementing these models.