Data Intelligence (Spring 2021)

Spring 2021
Spring 2020 Spring 2021 Spring 2022 Spring 2023 Spring 2024 Spring 2025

Official course description:

Full info last published 15/11-20

Course info

Language:

English

ECTS points:

7.5

Course code:

BBDAINT1KU

Participants max:

Offered to guest students:

yes

Offered to exchange students:

yes

Offered as a single subject:

yes

Price for EU/EEA citizens (Single Subject):

10625 DKK

Programme

Level:

Bachelor

Programme:

BSc in Global Business Informatics

Staff

Course manager

Jens Gwen Stein

Part-time Lecturer

Course Academic Responsible

Steffen Dalsgaard

Associate Professor, Head of study programme

Course semester

Semester

Forår 2021

Start

1 February 2021

End

14 May 2021

Abstract

The course aims to train the students in conducting a thorough and valid analysis of online data sources with the use of basic programming, statistics and business intelligence tools.

Description

This course is based on the assumption that we live in a world where the amount of data grows rapidly and where the need to be able to understand and analyze it becomes more and more pronounced. To keep up with this development it is therefore necessary to learn tools and techniques for interacting with these data sources and understanding the cultural contexts of the data. This course will teach you those tools and techniques using the programming language Python, basic statistics, and business intelligence.

The course aims to provide the students with tools to pose questions and get meaningful answers from data by minimising the amount of time it takes to arrive at the answer and maximising the relevance of the answer.

The course therefore focuses on two perspectives:

Data understanding: this will give the students a basic, theoretical understanding of data. We'll learn about different types of data, how you know which questions to pose, and which answers to expect. We'll also learn how to verify if the answers are meaningful and relevant, and how to measure their quality.
Data analysis: this will give the students a basic understanding of the concrete tools you can use for analysing data. We'll learn how to use the programming language Python for data analysis, how to translate your data questions in practice, how to make the procedure reproducible, and how to verify the answers you get from your code.

In order to work with the data sources of today, we'll also take some time for learning how to fetch and preprocess large amounts of data from modern and large data sources, both structured sources such as those exposed by and API and more loosely structured which must be fetched using web scraping techniques.

Formal prerequisites

Knowledge about fundamental Python programming
Knowledge about basic scientific theory

Intended learning outcomes

After the course, the student should be able to:

Write a Python program that extracts information from common data formats
Write a Python program that visually presents structured data
Discuss how to present information and findings using Python
Explain techniques for processing data in Python, given the size and format of the data
Write a Python program that interacts with HTTP APIs using simple authentication methods
Account for basic statistical measures and regression models
Explain the difference between statistical metrics such as precision, recall and accuracy
Discuss how sample populations relate to real-world populations
Reason about and describe a falsifiable question that can be addressed with a specific data source
Provide data-driven answers to falsifiable questions using statistical measures and regression models
Discuss the validity of an analytical conclusion based on the method and data

Learning activities

The course will mainly consist of lectures, group work, and project work with a focus on active students’ participation and practical application of data handling techniques.

Course literature

McKinney, Wes: Python for Data Analysis, O'Reilly Media, 2017
- https://wesmckinney.com/pages/book.html
Provost, Foster & Tom Fawcett: Data Science for Business, O'Reilly Media, 2013
- https://www.oreilly.com/library/view/data-science-for/9781449374273/
Ceder, Naomi: The Quick Python Book, Manning, 2018
- https://www.manning.com/books/the-quick-python-book-third-edition

Student Activity Budget

Estimated distribution of learning activities for the typical student

Preparation for lectures and exercises: 20%
Lectures: 25%
Exercises: 25%
Assignments: 20%
Exam with preparation: 10%

Ordinary exam

Exam type:
C: Submission of written work
Exam variation:
C22: Submission of written work – Take home

Exam submission description:
Submission of up to five pages of Python code analyzing data provided by ITU.

All aids allowed (open book exam).

Take home duration:
1 day