Published inData Engineer ThingsUnderstanding AWS Regions and Availability Zones: A Guide for BeginnersAmazon Web Services (AWS) has completely changed the game for how we build and manage infrastructure. Gone are the days when spinning up a…3d ago3d ago
Mastering Project Clarity: The Power of the RACI MatrixClear roles and ownership can save your team from confusion, delays, and finger-pointingApr 9Apr 9
Published inDataDrivenInvestorCAP and PACELC Theorems in Plain EnglishModern distributed systems are all about tradeoffs. Performance, reliability, scalability, and consistency don’t come for free — you…Jan 29Jan 29
Published inData Engineer ThingsTwo Archetypes of Data EngineersDiscover different archetypes of data engineers and how their collaboration drives data-driven successJan 18Jan 18
Published inData Engineer ThingsHow to Speed Up Spark Jobs on Small Test DatasetsDealing with small datasets (under a million records) can be a peculiar challenge when you’ve chosen Apache Spark as your go-to tool…Dec 6, 2024Dec 6, 2024
Why Use `pip install — user`?When working with Python, you’re likely familiar with the process of installing packages using the popular package manager, pip. It's a…Dec 3, 2024Dec 3, 2024
Comparing Dgraph and Neo4j Graph Databases: Key Differences and Use CasesIn modern data engineering, graph databases have gained prominence for their ability to efficiently store, query, and traverse…Nov 19, 2024Nov 19, 2024
Exploring the Power of Graph Databases“Everything is connected to everything else.” — Leonardo da Vinci.Nov 5, 2024Nov 5, 2024
Table Selection in Software EngineeringIn the world of poker, there is a strategy that goes beyond just playing the game well — it’s about choosing the right table. The idea here…Oct 15, 2024Oct 15, 2024
Senior Engineer FatigueI can’t go back to yesterday because I was a different person then — Alice, Lewis CarrollOct 1, 2024Oct 1, 2024