top of page

AWS Glue Pipeline

AWS Glue is a Serverless, Fully managed and Cloud-optimised Extract Transform and Load ETL Service by Amazon. In this project I will read a structured data from S3, create a data-catalogue using glue crawler, perform transformation with python and pyspark on glue notebook and finally upload the cleaned data back to S3. Find the slides here.

Slide1.jpeg
Slide2.jpeg
Slide3.jpeg
Slide4.jpeg
Slide5.jpeg
Slide6.jpeg
Slide7.jpeg
Slide8.jpeg
Slide9.jpeg
Slide10.jpeg
Slide11.jpeg
Slide12.jpeg
Slide13.jpeg
Slide14.jpeg
Slide15.jpeg
Slide16.jpeg
Slide17.jpeg
Slide18.jpeg
Slide19.jpeg
Slide20.jpeg
bottom of page