Official course description:

Full info last published 6/05-20
Course info
Language:
English
ECTS points:
7.5
Course code:
BBDAINT1KU
Participants max:
50
Offered to guest students:
yes
Offered to exchange students:
-
Offered as a single subject:
yes
Price for EU/EEA citizens (Single Subject):
10625 DKK
Programme
Level:
Bachelor
Programme:
BSc in Global Business Informatics
Staff
Course manager
Part-time Lecturer
Course Academic Responsible
Associate Professor, Head of study programme
Course semester
Semester
Forår 2020
Start
27 January 2020
End
31 August 2020
Exam
Exam type
ordinær
Internal/External
intern censur
Grade Scale
7-trinsskala
Exam Language
GB
Abstract

The course aims to train the students in conducting a thorough and valid analysis of online data sources with the use of basic programming, statistics and business intelligence tools. 

Description

The course is based on the assumption that we live in a world where the amount of data grows rapidly, but where the data also exposes more information. To keep up with this development, it is therefore necessary to equip ourselves with tools and techniques that dramatically speed up our interaction with increasingly complex data sources.

The course aims to provide the students with tools to quickly answer meaningful questions about data by 1) minimising the amount of time it takes to arrive at the answer and 2) maximising the relevance of the answer.

The course is structured in four blocks:

  1. Automating data collection tasks with Python
  2. Retrieving large amounts of data from online sources
  3. Analysing data with basic statistics and business intelligence
  4. Consolidating analysis validity for real-world problems

The first two blocks establishes programming tools that allow fetching and preprocessing large amounts of data from modern and large data sources, for instance Twitter’s API, WTO data sources and Danmarks Statistik. The second block focuses on the extraction of knowledge from the data, while ensuring that the knowledge is meaningful and relevant (valid).

Formal prerequisites

  • Knowledge about fundamental Python programming
  • Knowledge about database design and interaction
  • Knowledge about basic scientific theory

Intended learning outcomes

After the course, the student should be able to:

  • Write a Python program that extracts information from common data formats
  • Write a Python program that visually presents structured data
  • Discuss how to present information and findings using Python
  • Explain techniques for processing data in Python, given the size and format of the data
  • Explain the difference between databases in memory, on disk and distributed
  • Write a Python program that interacts with HTTP APIs using simple authentication methods
  • Account for basic statistical measures and regression models
  • Explain the difference between precision, recall and accuracy
  • Discuss how sample populations relate to real-world populations
  • Reason about and describe a falsifiable question that can be addressed with a specific data source
  • Provide data-driven answers to falsifiable questions using statistical measures and regression models
  • Discuss the validity of analytical conclusion based on the method and data
Learning activities

April 24th, 2020: Exam changed due to the Covid-19 situation and the change to online exams.

The course will mainly consist of lectures, group work, and project work with a focus on active students’ participation and practical application of data handling techniques.

Course literature

Suggested literature:


Ordinary exam
Exam type:
C: Submission of written work
Exam variation:
C22: Submission of written work – Take home
Exam submission description:
Submission of up to five pages of Python code analyzing data provided by ITU.

All aids allowed (open book exam).
Take home duration:
1 day

Time and date