KI:STE E-Learning

Lessons

Hadoop & MapReduce

Hadoop and Map (and) Reduce are two examples of modern, sophisticated designs for the asynchronous parallel processing of massive amounts of data. This section of the lecture provides an overview on the Hadoop architecture and describes the map-reduce algorithm with an example.

Lecturer

s

•

PD Dr. Martin Schultz

•

Additional Material

Git Repository with Jupyter Notes

Lessons

Intro Part I - Data science and big data analytics

Web accessible data & data publications

Pythons request library

Some hints for good data management

The netCDF file format

The role of metadata

Work with netCDF data in Python

Intro Part II - Data science and big data analytics

Types of data in Earth system science

5 "V" of Earth system data types

How to cope with > 1 TByte of data

Intro Part III - Data science and big data analytics

Challenges of large-scale data analysis and data system architectures

Data structures, data models & data patterns

Classic design patterns

Modern design patterns

Hadoop & MapReduce

Hadoop & MapReduce

Lecturer

s

Additional Material

Additional Material

Additional Material

KI:STE

Partners

Members

Legal