Mydra logo
Data
Data
IEBS Digital School 7 logo

IEBS Digital School 7

Distributed Architectures in Big Data

  • up to 1 month
  • Intermediate

Explore the essentials of distributed architectures in Big Data, focusing on evaluating tools and combining them to enhance data processing. Gain practical skills in architectures like Lambda and Kappa, and understand their application in real-world scenarios.

  • Distributed Systems
  • Big Data Processing
  • Data Architecture Design

Overview

This course provides an in-depth understanding of distributed data architectures, essential for handling large-scale data efficiently. Learn about parallelization, Map Reduce, Lambda and Kappa architectures, and how to manage resources using tools like HDFS, HIVE, and YARN. Practical case studies, such as Netflix's data architecture, are included to illustrate the application of these concepts.

  • Web Streamline Icon: https://streamlinehq.com
    Online
    course location
  • Layers 1 Streamline Icon: https://streamlinehq.com
    Spanish
    course language
  • Professional Certification
    upon course completion
  • Self-paced
    course format
  • Pre-recorded classes
    delivered online

Who is this course for?

Engineers

Engineers aiming to update their knowledge and develop new skills within Artificial Intelligence.

Programmers

Programmers looking to expand their knowledge and capabilities in the world of data science for their professional development.

Experienced Data Analysts

Data analysts who want to delve into deeper aspects of technology.

Why should you take this course?

Gain practical expertise in Big Data architectures to enhance your data processing capabilities. Learn through real-world applications like Netflix's data architecture and become proficient in tools such as HDFS, HIVE, and YARN. Ideal for engineers, programmers, and data analysts looking to advance in the field of Big Data.

Pre-Requisites

1 / 3

  • Basic understanding of data structures

  • Familiarity with programming concepts

  • Interest in Big Data technologies

What will you learn?

Introduction to Distributed Architectures: Parallelization and Map Reduce
Dive into the fundamentals of distributed architectures, focusing on parallelization and the Map Reduce programming model. Understand key concepts such as scalability, fault tolerance, and cluster geographic distribution.
Lambda and Kappa Architectures. Batch vs Streaming
Explore the differences between Lambda and Kappa architectures and the concepts of batch and streaming data processing. Analyze the advantages and disadvantages of each architecture with a real case study of Netflix.
Resource Management in Distributed Architectures
Master the use of Apache Foundation tools like HDFS, HIVE, YARN, and ZooKeeper. Learn about resource distribution and optimization in distributed environments through practical examples and a detailed study of Netflix's architecture.

Meet your instructor

  • Alejandro Pérez Pérez

    Instructor, IEBS Digital School

    Alejandro Pérez Pérez is an instructor at IEBS Digital School passionate about pioneering ethical AI solutions grounded in empathy and responsibility. He advocates for responsible AI integration, engaging global audiences in discussions on ethical considerations.

Upcoming cohorts

  • Dates

    start now

$510