First semester

NoSQL Technologies and Cloud

Objectifs

This course aims to give students a basic understanding of distributed architectures specialized in Big Data storage and processing. Assumptions, concepts and algorithms will be detailed for each of them.

The aim of the course is twofold. It will enable students to gain an understanding of the cloud-based systems currently available, to point to future directions for these architectures, and to find their way through the jungle of NoSql technologies to find the solution that best meets their needs. These databases differ from classic relational approaches. These approaches abandon the matrix representation of information and the SQL language in favor of greater simplicity, performance and scalability.

Plan

– Distributed file system with HDFS

– Distributed processing with Hadoop MapReduce

– Virtualization and containerization

Prérequis

Relational database