IT-Universitetet i København
 
  Tilbage Kursusoversigt
Kursusbeskrivelse
Kursusnavn (dansk):Talekodning og -genkendelse 
Kursusnavn (engelsk):Speech Coding and Recognition 
Semester:Efterår 2005 
Udbydes under:cand.it., medieteknologi og spil (mtg) 
Omfang i ECTS:7,50 
Kursussprog:Engelsk 
Kursushjemmeside:http://www.itu.dk/courses/TKG/E2005/index.html 
Min. antal deltagere:
Forventet antal deltagere:25 
Maks. antal deltagere:50 
Formelle forudsætninger:Unfortunately, the course is cancelled. Interested students are most welcome to sign up for the project cluster "Speech Coding and Recognition".

Highest level of math from highschool or equivalent. Course: Signal Processing or equivalent. 
Læringsmål:The human sense of hearing and its ability to talk are very important means of communication, which are gaining importance for IT-systems. Having completed the course the student is able to explain and apply models for production, perception and recognition of speech, necessary for the understanding, construction and performance evaluation of IT-systems, which use speech as one of the input/output media. 
Fagligt indhold:Course Background:

Natural and synthetic speech is becoming increasingly important in IT-systems, where it, among others, is applied in automatic information delivery systems; in reservation and information retrieval systems; in animated movies and cartoon movies; and in teleconferencing systems. Furthermore it is expected to be important in future systems for immersive telepresence connecting geographically distant sites, and networked systems for E-commerce, maintenance, and monitoring.



Course Contents:

Models for Speech Production: The human vocal tract. Linear prediction used for parameter estimation. Parameters for the male/female, and child voice.



Models for Speech Perception:

The human ear. Frequency analysis and pitch perception. Intensity discrimination. Time/frequency masking. Sound localization and auditory perception. The interaction between visual and auditory information.



Speech Coding, Recognition:

Speech coding using the CELP (Code Excited Linear Prediction) algorithms.

Principles of MP3 audio coding.

Speech recognition using the HMM (Hidden Markov Model) algorithms.

Noise reduction of speech.



Performance Evaluation:

Estimation of the subjective quality of a speech based system.

Future applications in Quality of Service (QoS) measures.



Demonstrations of human ear psychoacoustic properties important for coding of audio and speech.



Hands-on exercises on:

Spectral Analysis of Speech.

Speech Coding and Synthesis.

Speech Recognition.
 
Læringsaktiviteter:

The course is carried out through lectures from 17:00 to 19:00 and exercises from 19:00 to 21:00. The exercises are carried out in Matlab.




NB! In the introductory week, meaning from 29 August to 2 September 2005 the exercites are cancelled. Lectures from 16:00 to 18:30.




Depending on the number of students, the course manager is allowed to separate the students in two groups for exercises. One group will then do exercises in the evening (from 19:00 - 21:00 as planned) and one group can do exercises before the lectures. 

Eksamensform og -beskrivelse:X. experimental examination form (7-scale; external exam), 13-skala, Ekstern censur

 

Litteratur udover forskningsartikler:T.F. Quatieri

Discrete-Time Speech Signal Processing

Prentice Hall, 2001.



Ted Painter, Andreas Spanias

Perceptual Coding of Digital Audio

Proceedings of IEEE, Vol. 88, No. 4, April 2000.



Sadaoki Furui

Speech Recognition Technology in the Ubiquitous/Wearable Computing Environment.

Proc. 2000 IEEE Int. Conf. on Acoustics, Speech, and Signal Processing, Vol. IV, pp. 3735 - 3738.



Ram R. Rao, Tsuhan Chen, Russell M. Mersereau

Audio-to-Visual Conversion for Multimedia Communication

IEEE Transactions on Industrial Electronics, Vol. 45, No. 1, February 1998.
 
 
Afholdelse (tid og sted)
Kurset afholdes på følgende tid og sted:
UgedagTidspunktForelæsning/ØvelserStedLokale
Torsdag 13.30-16.00 Forelæsning ITU
Torsdag 16.00-18.30 Øvelser ITU

Eksamen afholdes på følgende tid og sted:
EksamensdatoTidspunktEksamenstypeStedLokale
2006-01-05 see time slot on course home page Mundtlig eksamen ITU see Examination Plan in the Study Guide on the ITU intranet
2006-01-06 do Mundtlig eksamen ITU do