Course Provider
What will you learn in BDA Foundation course?
- Hadoop - History Architecture
- Hadoop Components HDFS Architecture
- HDFS Operations
- Hands-on Exercises on Jigsaw Lab
- MapReduce Concept
- MapReduce Architecture
- YARN MapReduce Internals Hands-on Exercises
- History; Hive Architecture and Components
- Data Storage in Hive
- Data Types in Hive
- Hive Query Language Features
- Partitions in Hive Joins in Hive
- Advanced features - Handling JSON and XML format in Hive
- Hands-on Exercises
- HBase Overview and Architecture
- Data Model, Bulk Data Upload to Hbase
- Oozie Introduction and Overview Workflow
- Coordinator in Oozie
- Need_for_Visualizations-1
- Importance_of_Big_Data_Visualization
- Big_Data_Visualization_Tools_Tableau_Products-1
- Tableau_Installation_Workspace_Overview-1
- Working with Tableau, creating interactive dashboards, Integrating Tableau with Hadoop
BDA Foundation
-
Skill Type
Emerging Tech
- Domain
Big Data Analytics
- Course Category
Foundation course
- Certificate Earned Joint Co-Branded Participation Certificate
- Course Covered under GoI Incentive
Yes
-
- Course Price
INR 4,999+ 18% GST
- Course Duration
73 Hours
- Course Price
Why you should take BDA Foundation course?
- Know what Hadoop is, Hadoop Distributed File System, and MapReduce is, have working knowledge of how to use them, be able to work with HDFS and all basic operations Run MapReduce jobs with a given pre-compiled jar file and check the output
- Get introduced to Spark concepts and set up the environment
- A good understanding of RDD and working knowledge of RDD operations
- Participants will have an overview of Spark architecture, concepts of performance tuning, job submission and job management
- Understand and know how to use Spark Streaming, Spark SQL, Dataframes, the APIs. Understanding of Spark MLLib with practical examples.
Who should take BDA Foundation course?
All IT people who want to switch to data Engineering
Curriculum
- HDFS Architecture
- HDFS Operations
- MapReduce Concept
- MapReduce Architecture
- YARN, MapReduce Internals
- Hands-on Exercises
- Introduction to Spark and Set up, Spark Ecosystem and Abstractions
- Programming in Scala
- Spark Properties & Use Cases
- Basics of RDD Operations
- Transformations in RDD
- Actions in RDD
- Advanced RDD Operations
- RDD Persistence
- Overview of Shared Variables - Accumulators, Job Submission & Execution, Performance Tuning, Job Scheduling and Management
- Spark Streaming Architecture
- Structured Streaming
- Spark SQL
- Dataframes
- Machine Learning with Spark
- Machine Learning Case Studies with Spark.
Tools you will learn in BDA Foundation course
- Core Spark
- Dataframes
- RDD Operations
- Shared Variables
- Spark MLlib
- Spark SQL