Exploring Ollama

Run and Manage Large Language Models Locally with Ease

March 5, 2025 · 3 min

PySpark Cheat Sheet

A quick guide to PySpark and AWS Glue for efficient data processing and transformation.

January 23, 2023 · 4 min

Understanding and Managing Technical Debt

Insights on identifying and effectively handling technical debt in development.

June 3, 2022 · 5 min

Computation complexity

This post covers computational complexities using Python examples

September 27, 2021 · 7 min

AWS EMR Configurations

A guide to configuring Amazon EMR clusters, focusing on instance types, auto-scaling, and spot instance strategies.

July 3, 2021 · 3 min

Building a Data Lake on AWS - Essential Components and Best Practices

Explore how to build an efficient and scalable data lake on AWS using key services and best practices

October 23, 2020 · 4 min

Mastering AWS Glue - A Deep Dive into Glue Catalog, Jobs, and best practices

This post explores AWS Glue’s powerful ETL capabilities, focusing on Glue Catalog, Glue Jobs, and practical tips to optimize your data workflows.

September 12, 2020 · 5 min

ETL Options in AWS

A comparison of AWS ETL services EMR, Glue, and Lambda, highlighting their strengths, use cases, and best practices.

August 27, 2020 · 3 min