THE UNIVERSITY of EDINBURGH

DEGREE REGULATIONS & PROGRAMMES OF STUDY 2017/2018

University Homepage
DRPS Homepage
DRPS Search
DRPS Contact
DRPS : Course Catalogue : School of Informatics : Informatics

Postgraduate Course: Applied Databases (INFR11015)

Course Outline
SchoolSchool of Informatics CollegeCollege of Science and Engineering
Credit level (Normal year taken)SCQF Level 11 (Postgraduate) AvailabilityAvailable to all students
SCQF Credits10 ECTS Credits5
SummaryThe course gives an introduction to relational database systems and SQL, as well as to modern "NoSQL" database systems. The student learns how to search over heterogeneous data, both in exact and in approximate ways. This is accompanied by hands-on programming assignments. We study how to carry out large scale data analytics using stream processors and statistical programming languages. The course content is dynamic and will constantly be updated to state-of-the-art database systems.
Course description * Introduction to RDBMS: data models and ER diagrams. How to design a database schema. How to populate a database table. How to express queries using SQL. Speedup of Query Evaluation through indexes.

* Introduction to Storing and Searching Heterogeneous Data: Discuss heterogeneous data models, such as text, hierarchical, and graph shaped data. How can they be mapped into a RDBMS? How are they supported by new systems such as NoSQL databases. What kind of consistency guarantees are provided by the new NoSQL systems?

* Similarity Search: How can we capture that two items are "similar". How can we efficiently search for "similar items"? How can these insights be used to build, e.g., recommendation systems as the ones used by Amazon? What are the challenges when looking for similar complex items, such as images or videos?

* Data Analytics: What is the precise difference to conventional database queries? What kind of analytics can be supported highly efficiently, such as through stream processing where data is not stored locally. Where are the limits? What kind of analytics can be carried out efficiently using statistical programming languages over large data sets?

Relevant QAA Computing Curriculum Sections: Databases, Middleware

The course gives a practical introduction to databases. The first focus is on conventional relational databases systems, and on SQL as query language. Then, new modern systems such as NoSQL systems are introduced. Two particular important topics are covered: how to search for "similar items", and, how to carry out large scale data analytics.
Entry Requirements (not applicable to Visiting Students)
Pre-requisites Students MUST have passed: Database Systems (INFR10055) OR Database Systems (INFR10070)
Co-requisites
Prohibited Combinations Other requirements The course assumes good programming skills in Java.

For Informatics PG and final year MInf students only, or by special permission of the School.
Information for Visiting Students
Pre-requisitesNone
High Demand Course? Yes
Course Delivery Information
Academic year 2017/18, Available to all students (SV1) Quota:  None
Course Start Semester 2
Timetable Timetable
Learning and Teaching activities (Further Info) Total Hours: 100 ( Lecture Hours 20, Seminar/Tutorial Hours 6, Summative Assessment Hours 2, Programme Level Learning and Teaching Hours 2, Directed Learning and Independent Learning Hours 70 )
Assessment (Further Info) Written Exam 60 %, Coursework 40 %, Practical Exam 0 %
Additional Information (Assessment) Assessment Information:

For proper evaluation, students must be presented with real problems, rather than "toy" ones which can be solved in a very limited time. The focus on this course will be on largescale problem solving and critical thinking. To that end, students will pick a miniproject which comes in two linked assignments.

Assessment Weightings:

Exam: worth 70%
The students will deliver their work in two instalments:
* a refinement of the project description and rough prototype (worth 15%);
* a final presentation and a demonstration of their work at the end of the semester (worth 15%).

If delivered in semester 1, this course will have an option for semester 1 only visiting undergraduate students, providing assessment prior to the end of the calendar year.
Feedback Not entered
Exam Information
Exam Diet Paper Name Hours & Minutes
Main Exam Diet S2 (April/May)2:00
Learning Outcomes
On completion of this course, the student will be able to:
  1. Describe how database systems work and their application in Informatics
  2. Analyse data and describe using common description methods such as ER diagrams
  3. Populate a relational database and run queries over it. Load heterogeneous data into a NoSQL database and run queries over it.
  4. Design and implement similarity search in a database
  5. Run basic statistical queries over large data sets
Reading List
"Web Data Management" - Abiteboul, Manolescu, Rigaux, Rousset, Senellart, published by Cambridge University Press 2011

"Mining of Massive Datasets" - Anand Rajaraman, Jeffrey David Ullman, published by Cambridge University Press 2011.

Please note, these books are available online and free of charge:
http://webdam.inria.fr/Jorge/files/wdm.pdf
http://infolab.stanford.edu/~ullman/mmds/book.pdf


Additional Information
Course URL http://www.inf.ed.ac.uk/teaching/courses/ad
Graduate Attributes and Skills Not entered
KeywordsDatabase systems,Data management,Similarity Search,Data Analytics
Contacts
Course organiserDr Sebastian Maneth
Tel: (0131 6)51 5642
Email:
Course secretaryMs Alexandra Welsh
Tel: (0131 6)50 2701
Email:
Navigation
Help & Information
Home
Introduction
Glossary
Search DPTs and Courses
Regulations
Regulations
Degree Programmes
Introduction
Browse DPTs
Courses
Introduction
Humanities and Social Science
Science and Engineering
Medicine and Veterinary Medicine
Other Information
Combined Course Timetable
Prospectuses
Important Information
 
© Copyright 2017 The University of Edinburgh - 6 February 2017 8:08 pm