High Performance & Scalable Analytics, NO-SQL Big Data Platforms

Credits: 
2
Hours: 
20
Area: 
Big Data Technology
Academic Year: 
2016-2017
Description: 

The aim of this course is to introduce the student with the high performance Big Data management tools. The student will gain expertise in the use od NO-SQL platforms for the analysis and mining of large data volumes, thus performing tasks that would not be feasible with traditional data bases.

Notions: 

The course illustrates the techniques,methodologies and programming tool for conducting data analysis and knowledge extraction from Big Data also exploiting large computational infrastructures.

Technics and tools: 

Python, Hadoop, Pig, Hive, MongoDB, Spark

Case studies and datasets: 

Some new datasets (Twitter, Movie Rating, Mobility) will be provided. Datasets used in other courses will also be analyzed.

Competences: 

The student will gain expertise in handling high performance computing tool for parallel and distributed platforms, and he will experiment several applications and use cases on real-world datasets

Partners