2025  4

May  1

🦾 Picture Perfect Match: Building an Image Similarity Search Engine with Vector Databases🤖

May 15, 2025 · 9 min · 1729 words · Vesko Vujovic

April  2

📊 The Analytics Self-Service Revolution: How Data Catalogs Empower Enterprise Teams 💡

April 24, 2025 · 13 min · 2588 words · Vesko Vujovic

🏗️ Why Data Warehouses Backed by Open Table Formats Could Completely Replace Traditional DWHs 🌊

April 5, 2025 · 13 min · 2584 words · Vesko Vujovic

February  1

🚨 The Hidden Pitfall That Sabotages SQL Performance: Functions on Indexed Columns 📉

February 5, 2025 · 6 min · 1250 words · Vesko Vujovic

2024  7

November  2

Speed Up Your Spark Jobs: The Hidden Trap in Union Operations

November 29, 2024 · 4 min · 844 words · Vesko Vujovic

AWS Lambda Event Source Mapping: The Magic Behind Kafka Offset Management

November 16, 2024 · 5 min · 1002 words · Vesko Vujovic

October  2

DuckDB Inside Postgres: The Unlikely Duo Supercharging Analytics

October 30, 2024 · 9 min · 1730 words · Vesko Vujovic

Apache Spark: Beware of Column Ordering and Data Types When Using Apache Spark’s Union Function

October 6, 2024 · 5 min · 1038 words · Vesko Vujovic

September  1

Apache Spark: Why JSON isn’t ideal format for your spark job

September 9, 2024 · 5 min · 924 words · Vesko Vujovic

August  1

AWS: Lambda Event Source Mapping with Confluent Kafka

August 4, 2024 · 5 min · 981 words · Vesko Vujovic

July  1

Apache Spark: Dataset vs Dataframe - The Tortoise and Hare

July 21, 2024 · 5 min · 976 words · Vesko Vujovic