Partitioning Practices in Apache Hive and Apache Iceberg

Partitioning Practices in Apache Hive and Apache Iceberg

# Partitioning Practices in Apache Hive and Apache Iceberg## Introduction The efficiency of query execution is paramount. One of the key strategies to optimize this efficiency is through the use of

Read More
Columnar vs. Row-based Data Structures in OLTP and OLAP Systems

Columnar vs. Row-based Data Structures in OLTP and OLAP Systems

[Follow my Data Youtube Channel](https://www.youtube.com/@alexmerceddata)The decision between using columnar and row-based data structures can significantly impact the performance and efficiency of

Read More
Introduction to Data Vault Modeling

Introduction to Data Vault Modeling

[Subscribe to my Data Youtube Channel and Podcasts, Links Here](https://bio.alexmerced.com/data)Data Vault modeling is an approach to data warehouse design that offers a unique method for handling

Read More
Table Format FUD - Thinking Through the Table Format Conversion (Apache Iceberg, Apache Hudi, Delta Lake)

Table Format FUD - Thinking Through the Table Format Conversion (Apache Iceberg, Apache Hudi, Delta Lake)

## ContextThis article is meant to be a sober reflection on the data lakehouse table format conversation I have had as a participant over the last two years. I've written the following articles ove

Read More