Big Data using Hadoop and Spark

Big Data using Hadoop and Spark

Course Chapters and Description

  • Characteristics of Big Data Technology
  • Introduction to Hadoop
  • Hadoop Configuration
  • Hadoop Core Components – HDFS and MapReduce
  • HDFS Architecture
  • Data Types in Hadoop
  • Hadoop MapReduce – Features and Processes
  • Advanced MapReduce and HDFS
  • Introduction to PIG
  • Introduction to HIVE
  • HIVE – Characteristics 
  • Hive Query Language
  • Data Models and Data Types in HIVE
  • Introduction to HBASE
  • Characteristics and Architecture of HBASE
  • Cloudera – Introduction
  • Cloudera Distribution and Manager
  • Comparison of PIG, HIVE and MapReduce
  • Introduction to Zoo Keeper, Sqoop and Oozie
  • PySpark: Spark with Python
  • Loading Data in PySpark shell
  • Python Functional programming using filter() and map()
  • Programming in PySpark RDD’s
  • PySpark SQL and Dataframes
  • Machine Learning with PySpark MLlib
  • Introduction to Spark in R using sparklyr

Similar IT Training Courses

Data Science using Python

Complete data science training with python for data analysis

Data Science using R Training Course

Learn and master data science concepts using R.

Microsoft Courses

Find the IT training you need.  Learn more about our Microsoft courses

Reach Us

Call or use the form to request a free initial consultation.

Office 1.05, 1st Floor, Building 2,Croxely Business Park, Watford, WD18 8YA

enquiries@bsbitraining.co.uk

    Leave A Message

    10 + 4 =