DataboltDATABOLT
CHALLENGESPATTERNSLEARNDISCUSSIONSWRITE-UPSMY WORK
DataboltDATABOLT
Home
WORK
My WorkNotebooksCode / Scripts
COMMUNITY
DiscussionsCompetitionsContributionsWrite-ups
LEARN
Learning PathsNotebook Playground
ACHIEVEMENTS
Badges
SETTINGS
Cloud (BYOC)
RECENT
ETL Pipeline - Customer Data
10m ago
ETL Speed Race
1h ago
Late-arriving data approach
2h ago
Kafka Consumer Script
4h ago
Spark Fundamentals
8h ago
LOGIN / SIGN UP

WRITE-UPS

Blogs, tutorials, news, and case studies from the community

NEW WRITE-UP
FEATURED

Building a Cost-Efficient ETL Pipeline at Scale

How we reduced our Spark cluster costs by 73% while improving throughput

Sarah ChenFeb 10, 20262,456189
BLOG8 min read
S
Sarah ChenFeb 10, 2026

Building a Cost-Efficient ETL Pipeline at Scale

How we reduced our Spark cluster costs by 73% while improving throughput

sparkcost-optimizationdelta-lakeetl
REFERENCES
Salted Join PatternIncremental LoadingETL Speed RaceCustomer ETL Notebook
342,456
ANNOUNCEMENT4 min read
D
Databolt TeamFeb 8, 2026

Databolt February Update: New Streaming Challenges & Flink Support

Plus: improved notebooks, community badges, and the winter leaderboard results

platformupdatestreamingflink
REFERENCES
Real-Time Streaming ChallengeSpark Fundamentals Path
785,891
TUTORIAL12 min read
S
Marcus JohnsonFeb 6, 2026

A Beginner's Guide to Change Data Capture with Debezium

Step-by-step tutorial for setting up CDC pipelines from PostgreSQL to Kafka

cdcdebeziumkafkapostgresstreaming
REFERENCES
CDC Pipeline PatternKafka Streaming ModuleCDC Demo Notebook
221,823
CASE STUDY6 min read
P
Alex RiveraFeb 3, 2026

How We Won the $100 Budget Challenge

Processing 100M records for $12.47 using serverless Spark and aggressive partition pruning

cost-optimizationserverlesssparkcompetition
REFERENCES
$100 Budget ChallengeIncremental LoadingBudget Challenge Pipeline
563,412
BLOG10 min read
D
Priya SharmaJan 28, 2026

Understanding Spark's Adaptive Query Execution

Deep dive into AQE internals and when it helps (and when it doesn't)

sparkaqeperformanceinternals
REFERENCES
Spark Fundamentals - Module 7Salted Join Pattern
151,245