AWS EMR Configurations

A guide to configuring Amazon EMR clusters, focusing on instance types, auto-scaling, and spot instance strategies.

July 3, 2021 · 3 min

Building a Data Lake on AWS - Essential Components and Best Practices

Explore how to build an efficient and scalable data lake on AWS using key services and best practices

October 23, 2020 · 4 min

Mastering AWS Glue - A Deep Dive into Glue Catalog, Jobs, and best practices

This post explores AWS Glue’s powerful ETL capabilities, focusing on Glue Catalog, Glue Jobs, and practical tips to optimize your data workflows.

September 12, 2020 · 5 min

ETL Options in AWS

A comparison of AWS ETL services EMR, Glue, and Lambda, highlighting their strengths, use cases, and best practices.

August 27, 2020 · 3 min