Reflections on Data Science (Spring 2021)

Spring 2021
Spring 2020 Spring 2021 Spring 2022 Spring 2023 Spring 2024 Spring 2025

Official course description:

Full info last published 12/03-21

Course info

Language:

English

ECTS points:

7.5

Course code:

BSREDAS1KU

Participants max:

Offered to guest students:

Offered to exchange students:

Offered as a single subject:

Programme

Level:

Bachelor

Programme:

BSc in Data Science

Staff

Course manager

Roberta Sinatra

Assistant Professor

Teacher

Tiago Oliveira Cunha

Postdoc

Course semester

Semester

Forår 2021

Start

1 February 2021

End

14 May 2021

Abstract

In this course you will learn to reflect on the use and societal implications of data, models and algorithms.

Description

Whether to provide predictions, information, assessments, or evidence of some sort, the use of data comes with important responsibilities, ethical concerns, social impact, and sometimes generates unintended consequences. How can we check that a claim based on data is plausible? How can we ensure that our data analysis is sound and reproducible? What are the technical and societal consequences of using biased data to train our algorithms? In this course we will explore these and other similar questions using real-world cases studies, and we will provide a set of concepts, approaches and tools

to think critically about the data, models and algorithms that constitute evidence in the social and natural sciences and that provide predictions of any sort, and
to reflect on the consequences and ethical concerns when using data.

This course will cover the following main topics:

Calling BS with data
Reproducibility
Reviewing causality and lying with statistics
Algorithmic bias
Traps in the use of big data
Diffusion processes important for society (fake news, epidemics, performance and success)

For each topic, the course will focus on the societal impact of the studied concepts, and will emphasize how we, data scientists, can ensure and promote data use that is correct, ethical and unbiased.

Formal prerequisites

This course is designed for 6^th semester Bachelor in Data science students, and as such builds on the knowledge acquired in the courses in the previous 5^th semesters.

Intended learning outcomes

After the course, the student should be able to:

Describe cases of misuse of data, and identify wrong or inaccurate claims using various appropriate tools
Apply tools to ensure the reproducibility of data results
Provide and discuss causal explanations based on data
Describe issues that can arise with the use of big data
Identify biased analyses and algorithms, and discuss possible solutions to correct for the biases
Apply theoretical concepts and approaches to think critically about the data and models that constitute evidence in the social and natural sciences,
Reflect on the benefits and drawbacks of using digital data in research, business, and in our everyday life

Learning activities

The courses consists of lectures and exercises. Beyond lectures and exercise sessions, we will have various online activities to be done before and after classes.

These online activities will include: readings and videos, discussion on forum, writing documents, statistical analyses. During the class, we will have group discussions, class discussions, quizzes, writing sessions, and various hands-on exercises based on the preparation activities done at home.

During the course the teachers will offer the opportunity to submit optional assignments (with specific format and deadlines) and receive feedback.

Mandatory activities

There are no mandatory activities. The students are however strongly encouraged to hand in assignments during the course to receive feedback.

The student will receive the grade NA (not approved) at the ordinary exam, if the mandatory activities are not approved and the student will use an exam attempt.

Course literature

Study materials will be provided during the course from multiple sources (book excerpts , research papers, videos)

Student Activity Budget

Estimated distribution of learning activities for the typical student

Preparation for lectures and exercises: 15%
Lectures: 25%
Exercises: 20%
Assignments: 10%
Project work, supervision included: 15%
Exam with preparation: 10%
Other: 5%

Ordinary exam

Exam type:
C: Submission of written work, Internal (7-point scale)
Exam variation:
C1G: Submission of written work for groups

Exam submission description:
Students will submit a (1) journal to document cases of data misuse and algorithmic bias, (2) a paper about a course exercise.

Detailed info about the exam submission will be given during the course.

Group submission:
Group

Group size 3-4.
Early in the course we will ask you to form pairs. The teachers will then match pairs as well as any non-paired student to form groups.