top of page
AWS Glue Pipeline
AWS Glue is a Serverless, Fully managed and Cloud-optimised Extract Transform and Load ETL Service by Amazon. In this project I will read a structured data from S3, create a data-catalogue using glue crawler, perform transformation with python and pyspark on glue notebook and finally upload the cleaned data back to S3. Find the slides here.
bottom of page