Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 2

GLUE ETL

1. Overview of cloud and AWS

Overview of AWS web services and Free tier Account

Creating an AWS Account

Exploring Web Console

Overview of AWS CLI tool, SDKs and APIs

Overview of EC2 instance

2. Data Storage on AWS

Overview of AWS Storage Service

Overview of S3, Glacier

Creating S3 Bucket

Properties of S3 bucket

Working with RDS databases

Overview of No-SQL database (DynamoDB)

3. Introduction To Glue

Glue Basics

Features of Glue

Glue Components

Securing with IAM

Setting up environment

Pointing to specific data stores and endpoints

4. Glue Data Catalogue

What and why of Data Catalogue

Crawlers

Connecting to your data store


Using Crawlers for Catalogue tables.

5. Working with Glue Jobs

Overview and working of Glue Jobs

Adding new jobs in Glue

Editing scripts in Glue

Triggering Jobs and their scheduling

6. Administering Glue

Scheduling Jobs

Scheduling Crawlers

Logging and monitoring Glue

7. ETL scripts and Glue APIs

ETL scripts in Python

Various Glue APIs

Common Data Types and Exceptions

Use-cases and Benefits

8. Troubleshooting on Glue

Where to look?

Connection issues

Most common issues

Best Practices for Glue

You might also like