Data Visualisation and Data-driven Decision Making
AbstractThe course is focused on forming the basis of a data communicator. The students should recognize the various features of the human perception system, and operate around those limitations. We expect a student to be able to create effective visual encodings (charts), data-driven stories, dashboards and storyboards. The course will empower students with the underlying assumptions, guidelines and trappings of visualizing quantitative information. In doing so, the curriculum will put the field of data visualization in context; touching on its history from the first historic examples, right up until the propensity today of misleading people.
The clear communication of analytic results is the primary end goal of a scientific endeavor. The origin of a data science project is the question “What do we want to communicate and how?” The “how” heavily depends on data visualization, a core activity needed from the very early exploratory phases to the eventual explanatory phases. Moreover, creating visualizations is not simply an activity to wrap up a project at the end: it is rather an objective that informs all phases of data analysis, including the early stages of interrogating datasets to better understand them.
One cannot rely on the unprincipled application of graphical capabilities (and defaults of standard software) such as bar/line/pie charts, because each visual choice has repercussions in perception, interpretation, and internalization of information. In addition, the accumulation of smaller design decisions can result in a wide range of effectiveness. In light of these challenges, this course aims to substantially equip Data Science students with methods, tools and tactics for critical reasoning along their future paths of working with quantitative data.The following subjects will be covered in the course:
- Introduction– Why We Visualize and a History of Information Graphics
- Visual Vocabulary: A Tour through the Dataviz Zoo
- Human Perception and the Visual System
- The Details of Information Design
- Human Computer Interaction (HCI) & Decision Support
- Design Science and Design Thinking
- Geospatial Visualization
- Network Visualization
- Storytelling & Data-Driven Journalism
- Uncertainty: How to Visualize it
- Visual Literacy (Graphicacy) and Xenographics
- Ethics & Miscommunication – The Truth Continuum and How to Spot Lies
- Visual analytics / dashboarding
The course mandatory for BSc in Data Science fourth semester.
The course requires basic programming skills and basic statistical knowledge. The specific prerequisites are covered in the curricula of the courses "Introduction to Data Science and Programming" and "Applied Statistics" offered in the first year program of Data Science.
Intended learning outcomes
After the course, the student should be able to:
- Define data visualization (versus information graphics, or information visualization) and explain the differences in visualization purposes, from exhibitroy to exploratory and explanatory.
- Identify the most appropriate visualization strategy given the result of an data analysis process and an intended communication objective.
- Follow a checklist of established design guidelines when building a visualization to account for human perception, communicating uncertainty, and reducing potential miscommunication.
- Conversely, students should be able to readily identify instances where general guidelines should be broken, and provide practical and theoretically informed arguments supporting their design decisions.
- Produce alternative visualizations for the same message, compare their relative strengths and weaknesses, and make a motivated choice for the preferred one given the intended communication outcome.
- Identify deceptive usage of visual communication, or unintentionally misleading graphical representations of data (evidence). By learning to spot the most common techniques for lying with data, students also become more in-tune with ethical considerations and how not to visualize data.
- Generalize standard visualization techniques and customize them to better fit the visual literacy and/or intended communication outcome for a particular audience with domain-specific problems.
- Likewise, participants will explore methods for relentlessly expanding the truthfulness of their models when abstracting data from the real world.
Lectures, feedback, peer-to-peer feedback, project work, active students’ participation, reflection, practical exercises & case studies.
Cairo, A. (2016) The truthful art: Data, charts, and maps for communication (available at Academic Books på Søndre Campus (KU)). Additional readings will be provided through LearnIT.
Student Activity BudgetEstimated distribution of learning activities for the typical student
- Preparation for lectures and exercises: 14%
- Lectures: 33%
- Exercises: 33%
- Project work, supervision included: 14%
- Exam with preparation: 6%
Ordinary examExam type:
D: Submission of written work with following oral, internal (7-trinsskala)
D22: Submission of written work with following oral exam supplemented by the work submitted.
The students will develop an individual visualization project throughout the course. They will choose a dataset and an intended communication objective and apply the theory of data visualization to their objective. The final project can take any form the student finds appropriate for their communication objective including -- but not limiting to --: static visualization gallery (e.g. a paper), interactive visualization to explore data (e.g. a website), a poster.
The work will be submitted in learnIT before the examination, which will consist in an oral exam, where the student will present their communication objective and motivate their choices. Questions will be asked to connect the choices to the theoretical concepts presented during the course.