MA3831 - Big Data
|Student Contribution Band:
This subject will provide students with cutting-edge tools and techniques for high-performance
and large-scale computing, with focus on computer models and software designed to
handle Big Data sets in a distributed and/or parallel fashion. Particular focus will
be given to distributed and parallel computing using Map-Reduce/Hadoop and similar
models for processing Big Data sets.
- list the different systems and approaches for high-performance and large-scale computing,
as well as explain their differences;
- conceptually describe and apply models for distributed and parallel computing of Big
Data sets, such as MapReduce and Spark;
- choose and apply different techniques and software for distributed and cloud computing
of Big Data, such as Hadoop.
Minor variations might occur due to the continuous Subject quality improvement process,
and in case
of minor variation(s) in assessment details, the Subject Outline represents the latest