Official course description:

Full info last published 15/05-22
Course info
Language:
English
ECTS points:
15.0
Course code:
KSADDAS1KU
Participants max:
20
Offered to guest students:
yes
Offered to exchange students:
yes
Offered as a single subject:
yes
Price for EU/EEA citizens (Single Subject):
21250 DKK
Programme
Level:
MSc. Master
Programme:
MSc in Computer Science
Staff
Course manager
Associate Professor
Teacher
PhD student
Teacher
PhD student
Course semester
Semester
Efterår 2022
Start
29 August 2022
End
31 January 2023
Exam
Exam type
ordinær
Internal/External
ekstern censur
Grade Scale
7-trinsskala
Exam Language
GB
Abstract

In this course, you will both learn state-of-the-art techniques that power state-of-the-art data-intensive applications and systems running on modern hardware and get to apply these techniques on a modern data-intensive system.

Description

To transform the sheer amount of complex data into timely discoveries that influence the society, data-intensive systems (including database systems, big data processing systems and machine learning frameworks) must utilize the full processing power offered by modern processor and storage technologies.

In this course, you will learn the state-of-the-art techniques for data management and processing on modern hardware (multicores, hardware accelerators, microsecond-scale storage, and 100 GBE).

In parallel, you will apply some of these techniques on widely-used open-source data-intensive systems.

As a result, you will get hands-on experience with how to design, implement, and evaluate new components of an open-source data-intensive system.


Formal prerequisites

If you are taking this course as part of Data Systems specialization of ITU Computer Science MSc, then Computer Systems Performance class is a pre-requisite.

On the other hand, while the knowledge gained from Computer Systems Performance is very useful in this class, the course is also open to students who hasn't taken Computer Systems Performance. Therefore, if you wish to learn about or interested in research in advanced data systems topics, don't hesitate to register.

Intended learning outcomes

After the course, the student should be able to:

  • Analyze the functional and performance requirements of a data-intensive system (e.g., database system or machine learning platform)
  • Navigate the codebase of complex production-grade open-source software
  • Design and implement components in the context of a production-grade data system
  • * Evaluate the performance characteristics of a software system
  • Reflect upon the evolution of the hardware (processors, storage, networks) and its impact on the landscape of data-intensive systems and applications
  • Reflect upon research papers published by others and present research work to a broad technical audience
Learning activities

The course is composed of lectures and exercise sessions.

  • The lectures will have presentations and discussions of recent research papers (done by both the lecturers, external invited speakers, and students) that focus on data systems on modern hardware.
  • Each week, we will have a specific topic to focus on in terms of papers.
  • The exercise sessions will focus on project assignments. Therefore, in the distribution of time below for different class activities, exercises and project time add up to the total time spent on projects.


Course literature

The course literature is published on the course page on LearnIT.

There will be research papers to read each week on that week's topic.


Student Activity Budget
Estimated distribution of learning activities for the typical student
  • Preparation for lectures and exercises: 20%
  • Lectures: 25%
  • Exercises: 25%
  • Project work, supervision included: 25%
  • Exam with preparation: 5%
Ordinary exam
Exam type:
D: Submission of written work with following oral, External (7-point scale)
Exam variation:
D22: Submission with following oral exam supplemented by the submission.
Exam submission description:
Students submit a report based on the project assignments. This report is submitted for the exam.
Exam duration per student for the oral exam:
20 minutes

Time and date