IT-Universitetet i København
 
  Tilbage Kursusoversigt
Kursusbeskrivelse
Kursusnavn (dansk):Introduction to Image Analysis and Machine Learning, BSc 
Kursusnavn (engelsk):Introduction to Image Analysis and Machine Learning, BSc 
Semester:Forår 2017 
Udbydes under:Bachelor i softwareudvikling (bswu) 
Omfang i ECTS:7,50 
Kursussprog:Engelsk 
Kursushjemmeside:https://learnit.itu.dk 
Min. antal deltagere:15 
Forventet antal deltagere:
Maks. antal deltagere:30 
Formelle forudsætninger:The course assumes the student has the ability to program in some imperative programming language (Java, Python, C/C++/C#. This is normally obtained by following the first semester undergraduate course Grundlæggende Programmering (GPP) however some additional programming experience is beneficial.

The course uses concepts from mathematics. Fulfilling the general admittance requirements for BSWU ensures the mathematical prerequisites. 
Læringsmål:After the course, the student should be able to:

- Define, describe and relate concepts and mechanisms underpinning computer vision (CV) and machine learning (ML) methods and how they are related.

- Analyze and explain key aspects of building medium-sized computer vision applications.

- Explain, design and implement medium-sized interactive computer vision applications using concepts from ML.

- Evaluate, select and adapt appropriate computer vision and machine learning techniques by applying the theoretical concepts and practical techniques from the course.

- Clearly explain and employ basic linear algebra for computer vision and machine learning.

- Apply the theory and implement rudimentary research papers within CV and ML as expected on a bachelor level 
Fagligt indhold:The course is an introductory course to the basics of computer vision and machine learning. The objectives of this course are to provide students with the fundamental knowledge and skills required to design, build, and evolve smaller computer vision (CV) programs.

Computer Vision is the study of enabling machines to see and interpret the visual world through images and videos. Computer vision/image analysis and machine learning have in recent years played decisive roles in the development of new innovative applications based on images (e.g. various services provided by Google, Facebook, Microsoft, Snapchat etc). Having knowledge within computer vision and machine learning (e.g. deep learning) is an important skill needed in many modern innovative businesses and is likely to become even more important in the near future.

To this end the course introduces specific techniques from 2D and 3D image analysis and recent techniques from machine learning to solve computer vision problems. The course provides an introduction to fundamentals of image formation, camera imaging geometry, point processing, color spaces, feature detection and matching, multiview geometry, motion estimation /tracking, and object detection/ recognition.
Many successful and robust computer vision techniques (such as object recognition, tracking) heavily rely on concepts from signal analysis and machine learning. The course will introduce specific machine learning techniques and apply them to relevant computer vision problems such as recognition, matching and search.

Through the course the student should be able to use the technique in their own applications and within more advanced topics on computer vision, data science and pervasive computing.

Contents:
The course gives an introduction to computer vision, image analysis, linear algebra and programming. In the course we will present the fundamental models used for CV and machine learning as well as techniques to implement them. You will in the exercises and mandatory assignments be getting hands-on experience with the techniques described during the lectures. In the exercises we will use images from digital cameras and web cameras to illustrate the theory. Cameras will be available during the course.

In particular the course covers:

• Pixel-based and local processing of images (smoothing, edges, conversion between color spaces) and color image processing.
• Segmentation
• Object recognition.
• Geometric transformations (2D and 3D)
• Cameras, Stereo, structured light (Kinnect).
• Machine learning techniques applied to computer vision problems such as regression, classification techniques and will touch upon deep learning.


After completing this course, the students should be able to:

- Define, describe and relate concepts and mechanisms underpinning computer vision (CV) and machine learning (ML) methods and how they are related.

- Analyze and explain key aspects of building medium-sized computer vision applications.

- Explain, design and implement medium-sized interactive computer vision applications using concepts from ML.

- Evaluate, select and adapt appropriate computer vision and machine learning techniques by applying the theoretical concepts and practical techniques from the course.

- Clearly explain and employ basic linear algebra for computer vision and machine learning.

- Apply the theory and implement rudimentary research papers within CV and ML as expected on a bachelor level 
Læringsaktiviteter:

14 lectures + 14 exercise sessions.
Activities are divided between lectures and exercises however assignment work beyond the exercise hours must be expected. An average of 12 hours per week. Exercises are on a bachelor level.

You are expected to work systematically as expected on bachelor level. The course gives plenty of opportunities in the guided exercise session to gain hands-on experience with solving problems and with implementing algorithms in Python (or C# as the student desires).

We will be using sections from various references for the teaching material. 

Obligatoriske aktivititer:This course has 3 mandatory assignments that need to be approved to take the examination. Students will be given the opportunity to resubmit one improved version of any failed assignment during the semester. The deadline is one week after the first submission of this particular assignment.
The student will receive the grade NA (not approved) at the ordinary exam, if the mandatory activities are not approved and the student will use an exam attempt. 
Eksamensform og -beskrivelse:D22: Aflevering med mundtlig eksamen suppleret af aflevering., (7-scale, external exam)

The exam project consists of a final report contaning the report, code and result video and result images as expected on a bachelor level.

Duration of oral exam: 30 minutes.  

Litteratur udover forskningsartikler:tba