Flume and Sqoop for Ingesting Big Data

In the Hadoop ecosystem, Flume and Sqoop play key and very significant roles.

$99.00

In the Hadoop ecosystem, Flume and Sqoop play key and very significant roles.

More Information:

  • Modality: On Demand
  • Learning Style: Course
  • Difficulty: Intermediate
  • Duration: 2 Hours
  • Course Info: Download PDF
  • Certificate: See Sample

Course Information

About this Course:

In the Hadoop ecosystem, Flume and Sqoop play key and very significant roles. They are required to move data from sources such as local file systems, HTTP, MySQL and Twitter. They keep a hold on data as well as produce it to data stores such as HDFS, HBase and Hive. These tool both have functionality by default and contains the ability to remove any obstructions that might hinder the user’s way into transferring data among these systems.

Flume: Agents of Flume possesses the ability to move data produced by a streaming application to data stores such as HDFS or HBase.

Sqoop: It can be used to import data in bulk from regular RDBMs to Hadoop storage structures like HDFS or Hive.

Learning Objectives:

By the end of this course, the students should be have learnt the following skills and be able to practically apply for the various sources and data stores which are:

  • Sources: Twitter, MySQL, Spooling Directory, HTTP
  • Data Stores: HDFS, HBase, Hive

Flume Components:

  • Flume Agents
  • Flume Events
  • Event bucketing
  • Channel selectors
  • Interceptors

Sqoop Components:

  • Sqoop import from MySQL
  • Incremental imports using Sqoop jobs

Audience:

This course is targeted towards engineers who are required to design an application with HDFS/HBase/Hive as the data store. This will also come in handy for those engineers who are willing to port data from legacy data store to HDFS.

Prerequisites:

These are the prerequisites which are an absolute requirement for attending this course. People opting for this course should know how to use HDFS. They should also have a basic understanding of HBase and Hive shells. Since HDFS is required to carry out most of the examples it is therefore favorable that people should have it preinstalled.

Career & Salary Insight

Outline

Reviews

Write Your Own Review
Only registered users can write reviews. Please Sign in or create an account

Hit button to validate captcha