Professional Documents
Culture Documents
ForumDE_AzureDataEngineer_Curriculum
ForumDE_AzureDataEngineer_Curriculum
ForumDE_AzureDataEngineer_Curriculum
Trainer Details:-
Name-Arun Kumar
Ex-TCS,Ex-Harman,
Ex-Tredence Analytics
1
AZURE DATA ENGINEER PROGRAM
ETL CONCEPT
Data Warehouse Concept
SPARK
Architecture of Spark
Pyspark concepts
SparkSQL concepts
Lazy Evaluation
Immutability Concept
operation on RDD
operation on dataframe
Narrow Transformation
Wide Transformation
Accumulator
Broadcast variable
Partition By
partition pruning
cache vs persist
2
AZURE
Furnishing basic information about services of Azure with perspective of Data Engineering and how it
is used in industry and real time scenario.
Azure Databricks
Clusters
Widgets
Service Principal
Mount Point
dbutils commands
Databricks Runtime
Introductory notebooks
3
Azure Data Factory (ADF)
Pipelines
Activities
Datasets
Linked Services
Integration Runtimes
Pipeline Run
Parameters
Variables
Triggers
Global Parameters
ARM Template
Copy filtered data of one table from My SQL(on-premise) to Azure SQL Server.
4
Azure Key-Vault Service
How to connect AKV from ADF(Azure Data Factory) and read the secrets of AKV.
How to connect AKV from ADB(Azure Databricks) and read the secrets of AKV.
How to send mail alert for pipeline failure and success using logic app.
Software Installations
Installation of MySQL
Installation of Azure Data Studio and connection with Azure SQL Server
Intallation of Notepad++
In addition to this we will do one end to end data pipeline on Azure using all the major components
of Azure using Python as programming language.
We will take mock interview to prepare you for real time interview.
Batch Details:
The classes will commence on 28 July 2023 and the expected duration of the course will be 2
months.
5
The timing of classes will be 8:00 pm to 9:15 pm.
Classroom video recording will be provided to all the candidates till 6 months from the date of
course completion.