Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

FutureLearn

Introduction to Big Data Analytics with Hadoop

Packt via FutureLearn

Overview

Learn how to use the Hadoop ecosystem

Understanding Hadoop is a highly valuable skill for anyone working with large amounts of data. Companies such as Amazon, eBay, Facebook, Google, LinkedIn, Spotify, and Twitter use Hadoop in some way to process huge chunks of data.

On this three-week course, you’ll become familiar with Hadoop’s ecosystem and understand how to apply Hadoop skills in the real world.

Exploring the history and key terminology of Hadoop, you’ll then walk-through the installation process on your desktop to help you get started.

Explore Hadoop Distributed File System (HDFS)

With a solid introduction to Hadoop, you’ll learn how to manage big data on a cluster with Hadoop Distributed File System (HDFS).

You’ll also discover MapReduce to understand what it is and how it’s used before moving onto programming Hadoop with Pig and Spark.

With this knowledge, you’ll be able to start analysing data on Hadoop.

Understand MySQL and NoSQL

Next, you’ll learn how to do more with your data as you understand how to store and query data. To help you do this, you’ll learn how to use applications such as Sqoop, Hive, MySQL, Phoenix, and MongoDB.

Develop core data analyst skills

Finally, you’ll hone your data analyst skills by learning how to query data interactivity. You’ll also gain an overview of Presto and learn how to install it to ensure you can quickly query data of any size.

By the end of the course, you’ll have the skills to effectively work with big data using Hadoop and be able to streamline your processes.

This course is designed for anyone who works with big data.

You don’t need any prior experience of using Hadoop as you’ll start with the very basics.

On this course, we’ll show you how to install the Hadoop environment on your operating system.

Syllabus

  • Introduction to Hadoop and using the HDFS
    • Introduction to the course
    • Introduction to Hadoop
    • Using the Hadoop Distributed File System (HDFS)
    • Using the Hadoop's Core: MapReduce
    • Using the Hadoop’s Core: Activities and challenge exercise
    • Wrap up
  • Programming Hadoop with Pig and Spark
    • Introduction to Week 2
    • Introduction to programming Hadoop with Pig
    • Pig continued
    • Programming Hadoop with Spark
    • Data sets and Spark 2.0
    • Wrap up
  • Using relational and non-relational data stores with Hadoop
    • Introduction to Week 3
    • Using relational data-stores with Hadoop part 1
    • Using relational data-stores with Hadoop part 2
    • Using non-relational data stores with Hadoop (1)
    • Using non-relational data stores with Hadoop (2)
    • Using non-relational data stores with Hadoop (3)
    • Wrap up

Taught by

Charlie Travis

Reviews

Start your review of Introduction to Big Data Analytics with Hadoop

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.