People Trained
Recorded Course




Soumyadeep Dey is a Senior Solutions Architect (Data & AI) at Microsoft, where he helps organizations build and scale intelligent, data-driven solutions.
Previously, he worked at Amazon Web Services (AWS India) as a Data & AI Architect, partnering with diverse customers to modernize their data and AI workloads.
With experience across 50+ customers and multiple industries, he brings real-world insights into his courses and projects, bridging the gap between learning and practical implementation.
This course is ideal for students, working professionals, and tech enthusiasts who want to learn Big Data, Spark, PySpark, and Data Engineering through real, hands-on projects.
Basic Python knowledge helps, but everything from setup to advanced Spark operations is explained step-by-step — so beginners are welcome too!
Unlike typical theory-heavy courses, this one is completely project-based with real datasets (100GB–500GB) and focuses on practical, industry-relevant learning.
You’ll work with:
Apache Spark & PySpark
AWS EMR setup
Big Data ecosystem fundamentals
Spark DataFrames, Window functions, Transformations & Actions