Download as pdf or txt
Download as pdf or txt
You are on page 1of 2

Large Scale Computing for Data

Analytics
Course information sheet 2022-23

Full course, 10 weeks

This course aims to introduce learners to efficient implementation of computationally expensive


data-analytic methods and data-analytic methods for big data including deep learning and convo-
lutional neural networks, both in terms of applications and implementation in frameworks such
as Tensorflow or Keras. This course discusses enterprise-level technology relevant to big data
analytics such as Spark, Hadoop and NoSQL databases.

Prerequisite Knowledge

Learners should be familiar with the programming lan- eralised linear models and classification.
guage Python. In addition, Learners should be knowl-
edgeable in Bayesian statistics, statistical inference, gen-

Intended Learning Outcomes


By the end of this course learners will be able to:

assess and compare the complexity of an algorithm such as Tensorflow or Keras and assess the result;
and implementation both in terms of computational discuss important methodological aspects underpin-
time and memory, as well as suggest strategies for ning deep learning;
reducing those; explain the differences between SQL and NoSQL
describe key concepts of TensorFlow; databases and assess their suitability in different
perform basic computations with TensorFlow. real-life settings;
distinguish between different types of deep and/or explain the basic concepts underpinning big data
convolutional neural networks and choose an appro- systems such as Spark or Hadoop and discuss their
priate network for a given problem; suitability and use in different scenarios.
fit a neural network using specialised frameworks
The University of Glasgow is a registered Scottish charity: Registration Number SC004401.
Syllabus

Week 1 (sample material) Week 6 “This course has opened my eyes to


Large-scale distributed comput- Introduction to convolutional some of the work I’m likely to be
ing neural networks; doing in my workplace in the near
Assessing computational cost Applications of convolutional future. It also helped to explain
and complexity neural networks; some topics to me which I’d
Data parallelism Simple examples of convolutions; previously heard of but had not
The MapReduce paradigm Convolutions with TensorFlow. managed to obtain a full
Week 2 Week 7 understanding of.”
Introduction to TensorFlow Analysing sequential data with
Basic computations recurrent neutral networks;
Overview of key concepts Training recurrent neutral net- Online Learning
Simple linear regression with works; • Weekly live sessions with tu-
TensorFlow Implementing recurrent neutral tors
Week 3 networks in TensorFlow. • Weekly learning material
Classification with TensorFlow Week 8 (reading material, videos,
Creating a classifier to recognise Statistical computation and prob- exercises with model answers)
handwritten digits abilistic modelling with Tensor- • Bookable one-to-one sessions
Visualisation deep learning Flow Probability; with tutor(s)
Debugging TensorFlow Probabilistic programming;
Week 4 Understanding key features of
Understanding the underlying TensorFlow Probability;
mechanics of TensorFlow Statistical inference with Tensor- Textbooks
Understanding key concepts Flow Probability;
Aurelien, G (2019) Hands-On Ma-
needed to build Tensorflow mod- Bayesian statistics with Tensor-
chine Learning with Scikit-Learn
els including optimisers, layers Flow Probability;
and TensorFlow, O’Reilly Media,
and activation functions Fitting generalised linear models
Inc.
Week 5 with TensorFlow Probability.
Deep learning for image classifi- Week 9
cation Brief history of big data;
Introduction to OpenCV Management, modelling and Assessment
Deep learning in Python using computational issues with big (for credit only)
Keras data; This will typically be made up of 4
Data storage of big data; pieces of assessment, including on-
Mid-term week break Introduction to Hadoop; line quizzes, an individual project and
Introduction to Spark. an assignment.
Week 10
Data Analytics using Spark.

School of Mathematics and


Statistics
Software
To take our courses please use an up-to-date version of a standard browser (such as University of Glasgow
Google Chrome, Firefox, Safari, Internet Explorer or Microsoft Edge) and a PDF reader http://gla.ac.uk/
(such as Acrobat Reader). Learning material will be distributed through Moodle. Learn- mdatagov
ers need to have access to Python and the machine learning framework TensorFlow. It http://gla.ai
is recommended that you use Jupyter Google colaboratory notebook for this course,
Email:
however other options are available. Learners need to install Zoom for participating in
video conferencing sessions. We recommend the use of a head set for video conferenc- maths-stats-analyticscpd@
ing sessions. glasgow.ac.uk

You might also like