THE UNIVERSITY of EDINBURGH

DEGREE REGULATIONS & PROGRAMMES OF STUDY 2017/2018

University Homepage

DRPS : Course Catalogue : School of Informatics : Informatics

Postgraduate Course: Accelerated Natural Language Processing (INFR11125)

Course Outline
School	School of Informatics	College	College of Science and Engineering
Credit level (Normal year taken)	SCQF Level 11 (Postgraduate)	Availability	Available to all students
SCQF Credits	20	ECTS Credits	10
Summary	The course will synthesize ideas from linguistics and computer science to provide students with a fast-paced introduction to the field of natural language processing. The course will cover the most widely-used theoretical and computational models of language, including both statistical and nonstatistical approaches. The course will familiarize students with a wide range of linguistic phenomena with the aim of appreciating the complexity, but also the systematic behaviour of natural languages like English, the pervasiveness of ambiguity, and how this presents challenges in natural language processing. In addition, the course introduce the most important algorithms and data structures that are commonly used to solve many NLP problems. The course will cover formal models for representing and analyzing the syntax and semantics of words, sentences, and discourse. Students will learn how to analyse sentences algorithmically, using hand-crafted and automatically induced treebank grammars, and how to build interpretable semantic representations. The course will also cover a number of standard models and algorithms that are used throughout language processing. Examples include n-gram and Hidden Markov Models, the EM algorithm, and dynamic programming algorithms such as chart parsing. This course replaces INFR11059 Advanced Natural Language Processing
Course description	Part I: Words * Inflectional and derivational morphology * Finite state methods and Regular expressions * Word Classes and Parts of speech * Sequence Models (n-gram and Hidden Markov models, smoothing) * The Viterbi algorithm, Forward Backward, EM Part II: Syntax * Syntactic Concepts (e.g., constituency, subcategorisation, bounded and unbounded dependencies, feature representations) * Analysis in CFG - Greedy algorithms---Shift-reduce parsing * Divide-and-conquer algorithms---CKY * Chart parsing * Lexicalised grammar formalisms (e.g., TAG, CCG, dependency grammar) * Statistical parsing (PCFGs, dependency parsing) Part III: Semantics, Discourse, Dialogue and Applications * logical semantics and compositionality * Semantic derivations in grammar * Lexical Semantics (e.g., word senses, semantic roles) * Discourse and dialogue (e.g., anaphora, speech acts) * Text classification and sentiment analysis * Other applications (e.g., machine translation, question answering) Methodological topics, interspersed throughout: * Issues in annotation and evaluation * Machine learning approaches (e.g., Maximum Entropy models, neural networks) Relevant QAA Computing Curriculum Sections: Not yet available

Entry Requirements (not applicable to Visiting Students)
Pre-requisites		Co-requisites
Prohibited Combinations	Students MUST NOT also be taking Foundations of Natural Language Processing (INFR09028) OR Advanced Natural Language Processing (INFR11059)	Other requirements	Students with little or no previous programming experience must also register for Computer Programming for Speech and Language Processing. Labs and assignments require programming in Python at a level designed for students who are learning Python simultaneously with this course.

Information for Visiting Students
Pre-requisites	None
High Demand Course?	Yes

Course Delivery Information

Academic year 2017/18, Available to all students (SV1)		Quota: None
Course Start	Semester 1
Timetable	Timetable
Learning and Teaching activities (Further Info)	Total Hours: 200 ( Lecture Hours 30, Seminar/Tutorial Hours 5, Supervised Practical/Workshop/Studio Hours 8, Summative Assessment Hours 2, Programme Level Learning and Teaching Hours 4, Directed Learning and Independent Learning Hours 151 )
Assessment (Further Info)	Written Exam 70 %, Coursework 30 %, Practical Exam 0 %
Feedback	Not entered
Exam Information
Exam Diet	Paper Name		Hours & Minutes
Main Exam Diet S1 (December)			2:00

Learning Outcomes
On completion of this course, the student will be able to: Students should be able to construct examples of ambiguous Natural Language sentences and provide a written explanation of how ambiguity arises in natural language and why this is a problem for computational analysis. Given a grammar, semantics and sentence, students should be able to construct a syntatic and semantic analysis of the sentence. Given an appropriate NLP problem, students should be able to apply sequence models, parsing and search algorithms and provide a summary of their operation in this context. Given an appropriate NLP problem, students should be able to analyse the problem and decide which data structures and algorithms to apply. ? Review and classify search algorithms and ways of manipulating dynamic data structures. Given two NLP algorithms, students should be able to describe how they are related and illustrate differences and limitations by providing illustrative examples.

Reading List
Jurafsky and Martin, Speech and Language Processing, 2nd edition, 2008.

Additional Information
Course URL	http://course.inf.ed.ac.uk/anlp/
Graduate Attributes and Skills	Not entered
Keywords	Not entered

Contacts
Course organiser	Dr Henry Thompson Tel: (0131 6)50 4440 Email:	Course secretary	Ms Alexandra Welsh Tel: (0131 6)50 2701 Email:

Navigation

Help & Information

Search DPTs and Courses

Regulations

Degree Programmes

Courses

Humanities and Social Science

Science and Engineering

Medicine and Veterinary Medicine

Other Information

Combined Course Timetable

Important Information

© Copyright 2017 The University of Edinburgh - 6 February 2017 8:10 pm