Data Engineering Kickstart Course
In This Course,
You'll Learn:
Class - 1 (Introduction to Distributed Compute & Storage, Big Data, MapReduce)
Class - 2 (Understanding Big Data Ecosystem)
Class - 3 (Data Engineering File Formats - Parquet & Avro, ZooKeeper intro)
Class - 4 (Introduction to Spark with Batch & Stream Processing)
Class - 5 (Spark Ecosystem, its Development, Architecture & Execution)
Class - 6 (Install Spark on Laptop and AWS EC2 Instances)
Class - 7 (Working with Spark Standalone Cluster on Laptop)
Class - 8 (Working with Spark on AWS EMR)
Class - 9 (Spark Application Deployment Modes - Cluster & Client)
Class - 10 (Spark DataFrames, DAG & PySpark Module)
Class - 11 (What is SparkSession)
Class - 12 (DataFrame Reader & Writer to read/write from/to Sources & Targets)
Class - 13 (DataFrame details, File Partitions & Spark Tasks)
Class - 14 (Introduction to Basic Transformations - Select, Distinct, Where, Filter)
Class - 15 (Introduction to Spark Actions)
Class - 16 (Details of Spark UI & how to use it)
Class - 17 (Additional Spark Transformations - agg, alias, groupby)
Class - 18 (Additional Transformations - orderBy, selectExpr)
Class - 19 (Spark Shuffle Process (explore using SparkUI))
Class - 20 (Input Partitions in Spark)
Class - 21 (Output Partitions in Spark )
Class - 22 (Exploring Spark UI Further)
Class - 23 (How to Read Explain Plan and its Output)
Class - 24 (Analytics using Spark DATE Functions)
Class - 25 (Analytics using Spark WINDOW Functions)
Class - 26 (HANDS-ON PROJECT - Flight Layover Analysis, Travel Activity, Environment Impact)
Class - 27 (Execute Spark Applications on AWS EMR)
Class - 28 (Submit Spark code to AWS EMR Cluster from Laptop)