Published inData Engineer ThingsHow Not to Partition Data in S3 (And What to Do Instead)Learn the pitfalls of partitioning data by date in S3Oct 13A response icon4Oct 13A response icon4
Published inData Engineer ThingsHow the Community Turned Into a SaaS CommercialCan We Bring Back the Soul?Sep 15A response icon8Sep 15A response icon8
Published inData Engineer ThingsApache Spark Core Concepts ExplainedIf you’ve spent any time wrangling data at scale, you’ve probably heard of Apache Spark. Maybe you’ve even cursed at it once or twice —…Aug 11A response icon4Aug 11A response icon4
Published inData Engineer ThingsCluster Managers for Apache Spark: from YARN to KubernetesDeep dive into machinery that orchestrates SparkJul 21Jul 21
Published inData Engineer ThingsData Partitioning: Slice Smart, Sleep BetterEver had to migrate a petabyte-scale table because you picked the wrong partition key?Jun 30A response icon3Jun 30A response icon3
Published inData Engineer ThingsData Engineering: Now with 30% More BullshitTools don’t solve problems. People do. No buzzword replaces craftsmanship.May 20A response icon60May 20A response icon60
Published inData Engineer ThingsUnderstanding AWS Regions and Availability Zones: A Guide for BeginnersAmazon Web Services (AWS) has completely changed the game for how we build and manage infrastructure. Gone are the days when spinning up a…Apr 27Apr 27
Mastering Project Clarity: The Power of the RACI MatrixClear roles and ownership can save your team from confusion, delays, and finger-pointingApr 9Apr 9
Published inDataDrivenInvestorCAP and PACELC Theorems in Plain EnglishModern distributed systems are all about tradeoffs. Performance, reliability, scalability, and consistency don’t come for free — you…Jan 29Jan 29
Published inData Engineer ThingsTwo Archetypes of Data EngineersDiscover different archetypes of data engineers and how their collaboration drives data-driven successJan 18Jan 18