Home
posts
tags
archives
search
Archive
2025
4
May
1
🦾 Picture Perfect Match: Building an Image Similarity Search Engine with Vector Databases🤖
May 15, 2025
· 9 min · 1729 words · Vesko Vujovic
April
2
📊 The Analytics Self-Service Revolution: How Data Catalogs Empower Enterprise Teams 💡
April 24, 2025
· 13 min · 2588 words · Vesko Vujovic
🏗️ Why Data Warehouses Backed by Open Table Formats Could Completely Replace Traditional DWHs 🌊
April 5, 2025
· 13 min · 2584 words · Vesko Vujovic
February
1
🚨 The Hidden Pitfall That Sabotages SQL Performance: Functions on Indexed Columns 📉
February 5, 2025
· 6 min · 1250 words · Vesko Vujovic
2024
7
November
2
Speed Up Your Spark Jobs: The Hidden Trap in Union Operations
November 29, 2024
· 4 min · 844 words · Vesko Vujovic
AWS Lambda Event Source Mapping: The Magic Behind Kafka Offset Management
November 16, 2024
· 5 min · 1002 words · Vesko Vujovic
October
2
DuckDB Inside Postgres: The Unlikely Duo Supercharging Analytics
October 30, 2024
· 9 min · 1730 words · Vesko Vujovic
Apache Spark: Beware of Column Ordering and Data Types When Using Apache Spark’s Union Function
October 6, 2024
· 5 min · 1038 words · Vesko Vujovic
September
1
Apache Spark: Why JSON isn’t ideal format for your spark job
September 9, 2024
· 5 min · 924 words · Vesko Vujovic
August
1
AWS: Lambda Event Source Mapping with Confluent Kafka
August 4, 2024
· 5 min · 981 words · Vesko Vujovic
July
1
Apache Spark: Dataset vs Dataframe - The Tortoise and Hare
July 21, 2024
· 5 min · 976 words · Vesko Vujovic