KF7032 - Big Data and Cloud Computing

APPLY NOW Book an Open Day Add to My Courses Register your interest / Course PDF

What will I learn on this module?

In this module you will develop knowledge and skills that will enable you to tackle a realistic big data problem, using some of the principal machine learning techniques and statistical approaches used in big data analysis. Furthermore, you will learn how to implement your solution using an industry leading Cloud computing provider together with appropriate distributed processing environments.

You will learn how to host multi-terabyte sized big datasets using a cloud service provider. This will includes provisioning a commercial cloud provider, and then mastering appropriate distributed operating systems, such as Hadoop. You will then learn approaches to processing and analysing big data, based on advanced statistical processing, supervised and unsupervised machine learning algorithms and other state of the art big data analytic methods. Such techniques include clustering algorithms, pattern based information extraction, linear and non-linear regression, and feature based models. Inevitably, much work on big data analysis is statistical, so you will therefore develop some relevant statistical understanding. As data visualization is frequently critical in helping to develop hypotheses about the data, you will also cover and apply problem relevant 2D and 3D visualization methods where appropriate to the particular datasets.

How will I learn on this module?

You will learn through a combination of methods to support learning, including lectures, practical sessions in workshops and guided learning. Topics will normally be introduced in lectures and explored through practical exercises (helping you develop the practical skills needed) and guided learning activities. You will be encouraged to develop independent learning skills and the development of critical analytic approaches to the big data and cloud computing area.

More specifically, you will work in teams using a leading cloud services provider and big data analysis techniques as the basis of your practical work, giving you immediate saleable skills. Staff will support your learning through verbal feedback on your practical achievements.

All module material will be available on the eLearning Portal (ELP) so that you can access information when you need to. The university library offers support for all students through its catalogue and an Ask4Help Online service.

How will I be supported academically on this module?

Staff will support you in the practical sessions, providing advice and feedback on your progress and engaging in discussion with you, to examine your ideas and those of others as your tutors value your input and opinions. You will be strongly encouraged to engage in further study by yourself or with other students outside of class time to become an independent learner. This is an essential capability in every area of Computing, whose utility will long outlive the detail of current technical approaches.

This module will use and promote an eLP (Blackboard) based discussion forum. This will be configured to encourage you, other students and academic staff to participate in discussion about the subject matter of the module.

What will I be expected to read on this module?

You will read books, scientific refereed articles and conference papers. You will be expected to go beyond blogs, way beyond web pages and to develop independent critical research capabilities. This capacity to research and critically analyse formal literature will stand you in good stead when confronted with the swathes of uncritical marketing white papers with which the modern IT professional has to contend.

All modules at Northumbria include a range of reading materials with which students are expected to engage. The reading list for this module can be found at: http://readinglists.northumbria.ac.uk

What will I be expected to achieve?

Knowledge & Understanding:
1. Apply big data analytic algorithms, including those for visualization and cloud computing techniques to multi-terabyte datasets.
2. Critically assess data analytic and machine learning algorithms to identify those that satisfy given big data problem requirements

Intellectual / Professional skills & abilities:
3. Critically evaluate and select appropriate big data analytic algorithms to solve a given problem, considering the processing time available and other aspects of the problem.
4. Design and develop advanced big data applications that integrate with third party cloud computing services

Personal Values Attributes (Global / Cultural awareness, Ethics, Curiosity) (PVA):

5. Critically assess and interpret primary research to identify its applicability to a given big data problem scenario.

How will I be assessed?

Summative assessment
There will be two summative assessments.

1. Assignment 1 (70%): you will design and construct, an appropriate solution for a given big data problem scenario by provisioning and configuring appropriate Cloud Computing resources. You will need to select problem appropriate algorithms and methods of visualizing the results to best satisfy a realistic task. This assignment will assess MLOs 1, 3 and 4.

2. Assignment 2 (30%): you will write a 2000 word research report that will critically assess a big data approach. This assignment will assess MLOs 2 and 5.

Formative assessment and feedback
Formative assessment will take the form of practical tasks in workshop exercises. Feedback and guidance will be provided on these.

Verbal feedback will be provided on the formative assessment during the workshop sessions. Written feedback will be provided on the summative assessment.

Pre-requisite(s)

N/A

Co-requisite(s)

N/A

Module abstract

Big Data is the colloquial term used to describe the acquisition of knowledge, insights and understanding gained through identification of patterns in huge, multi-terabyte datasets. In this module you will develop knowledge and skills that will enable you to tackle a realistic big data problem. Furthermore, you will learn how to implement your solution using an industry leading Cloud computing provider together with appropriate distributed processing environments such as Hadoop. Frequently a first step in Big Data analysis insight is gained through visualizing the data. This may give insights into appropriate analytic approaches. You will also learn some of the principal machine learning techniques and statistical approaches used in big data analysis.

Course info

Credits 20

Level of Study Postgraduate

Mode of Study 2 year full-time with Advanced Practice

Department Computer and Information Sciences

Location Ellison Building, Newcastle City Campus

City Newcastle

Start September 2019

Current, Relevant and Inspiring

We continuously review and improve course content in consultation with our students and employers. To make sure we can inform you of any changes to your course register for updates on the course page.

Your Learning Experience find out about our distinctive approach at 
www.northumbria.ac.uk/exp

Admissions Terms and Conditions - northumbria.ac.uk/terms
Fees and Funding - northumbria.ac.uk/fees
Admissions Policy - northumbria.ac.uk/adpolicy
Admissions Complaints Policy - northumbria.ac.uk/complaints