Postgraduate Course: Speech Processing (LASC11065)
Course Outline
School | School of Philosophy, Psychology and Language Sciences |
College | College of Humanities and Social Science |
Credit level (Normal year taken) | SCQF Level 11 (Postgraduate) |
Availability | Available to all students |
SCQF Credits | 10 |
ECTS Credits | 5 |
Summary | Syllabus: Fundamentals of speech processing (familiarity with waveforms, spectra, spectrograms, resonance, formants, human speech production and perception, perceptually-motivated frequency scales, time vs. frequency representations; conversion between the two, the Fourier transform, source-filter model of speech, hands on experience via xwaves), speech recognition (components of a typical recogniser, parameterisation of the speech signal, dynamic time warping, distance measures, the Hidden Markov Model, the generative model paradigm, simple probability theory, conditional and joint probabilities, Bayes' theorem, Gaussian probability density function, continuous density HMMs, monophone models with Gaussian observation densities, Viterbi algorithm for recognition, training from fully labelled data, Viterbi training, bigram language models), speech synthesis (components of a typical text-to-speech synthesiser, text analysis, phonology, finite-state automata, POS tagging, lexicon, phrasing, accents, F0, learning from data, CART models, waveform generation, concatenative methods - TD-PSOLA and linear prediction, F0 and duration modification).
Formative Feedback Events:
After Assignment 1 - lab session will include general feedback on the assignments. All students will have the opportunity for a 15 minute session with the lecturer or tutor
After Assignment 2 - all students will have the opportunity for a 15 minute session with the lecturer or tutor |
Course description |
Not entered
|
Entry Requirements (not applicable to Visiting Students)
Pre-requisites |
|
Co-requisites | |
Prohibited Combinations | |
Other requirements | None |
Additional Costs | None |
Information for Visiting Students
Pre-requisites | None |
Course Delivery Information
|
Academic year 2015/16, Available to all students (SV1)
|
Quota: None |
Course Start |
Semester 1 |
Timetable |
Timetable |
Learning and Teaching activities (Further Info) |
Total Hours:
100
(
Lecture Hours 27,
Feedback/Feedforward Hours 1,
Programme Level Learning and Teaching Hours 2,
Directed Learning and Independent Learning Hours
70 )
|
Assessment (Further Info) |
Written Exam
60 %,
Coursework
40 %,
Practical Exam
0 %
|
Feedback |
Not entered |
Exam Information |
Exam Diet |
Paper Name |
Hours & Minutes |
|
Main Exam Diet S1 (December) | Speech Processing | 2:00 | |
Learning Outcomes
After taking this module, students should be able to:
- give an overview of the components of state-of-the art speech recognition and speech synthesis systems
- understand the main concepts and what each component does
- describe a simple version of each component
- see what the difficult problems are in recognition and synthesis. They will also: use tools for visualising and manipulating speech waveforms
- experiment with two state-of-the-art speech technology systems
- put experimental methodology into practice
- see how knowledge and skills from different areas come together in an interdisciplinary field
|
Additional Information
Graduate Attributes and Skills |
Not entered |
Additional Class Delivery Information |
**Please note that students only need to attend ONE of the TWO lab sessions. You will be assigned a lab session by the lecturer. ALL students should come to the first class. |
Keywords | Not entered |
Contacts
Course organiser | Prof Simon King
Tel: (0131 6)51 1725
Email: |
Course secretary | Miss Toni Noble
Tel: (0131 6)51 3188
Email: |
|
© Copyright 2015 The University of Edinburgh - 27 July 2015 11:27 am
|