// TOPIC

#streaming

10 articles

◆◆◆AdvancedMetaAirbnb
01

Design a Large-Scale Data Pipeline (ETL / Batch + Streaming)

Move and transform petabytes from sources into a warehouse/lake for analytics. DAG orchestration, Spark shuffles, lake vs warehouse, and idempotent, replayable jobs.

#interview#big-data#data-engineering
26 min
◆◆◆AdvancedStripePayPal
02

Design a Real-Time Fraud Detection System

Score transactions for fraud inline in milliseconds. Feature stores, streaming velocity features, rules + ML hybrids, graph fraud rings, and the label-delay problem.

#interview#ml#streaming
23 min
◆◆◆AdvancedMetaGoogle
03

Design an Ad Click Aggregator (real-time analytics)

Ingest billions of ad events, serve per-minute metrics in near-real-time, and produce exact totals for billing — the canonical streaming + lambda/kappa problem.

#interview#streaming#analytics
21 min
◆◆◆AdvancedTwitterMeta
04

Design Top-K / Trending (heavy hitters)

Find the top-K most frequent items in a massive stream without counting everything exactly. Count-Min Sketch, heavy-hitter algorithms, and approximate streaming aggregation.

#interview#streaming#probabilistic
22 min
◆◆◆AdvancedTwitchYouTube
05

Design a Live Streaming System (Twitch)

Ingest one broadcaster and fan out to millions of viewers with seconds of latency. Transcoding ladders, HLS/DASH segmenting, CDN fan-out, and live chat.

#interview#media#streaming
22 min
◆◆IntermediateNetflixLinkedIn
06

Backpressure & Flow Control

What happens when a fast producer overwhelms a slow consumer? Backpressure, bounded buffers, load shedding, and why unbounded queues are a trap.

#distributed-systems#reliability#streaming
17 min
◆◆◆AdvancedByteDanceMeta
07

Design TikTok / Reels (short-video platform)

A vertical-swipe video feed that feels infinite and clairvoyant. Two-tower retrieval, real-time ranking with Monolith, sub-200ms playback via aggressive preloading, and the For You Page that knows you in 90 seconds.

#interview#video#recommendation
22 min
◆◆IntermediateGoogleAmazon
08

Design Search Autocomplete (Typeahead)

Sub-100ms autocomplete suggestions across billions of queries — tries, top-k caching, and personalized ranking.

#interview#search#trie
15 min
◆◆◆AdvancedUberLyft
09

Design Uber / Lyft (ride hailing)

Match drivers to riders in real time at city scale. Geohashing, dispatch algorithms, surge pricing, and the realtime location pipeline.

#interview#geo#realtime
17 min
◆◆◆AdvancedGoogleNetflix
10

Design YouTube / Netflix (video streaming)

How a billion users watch ~1 billion hours of video a day. Upload pipeline, transcoding, adaptive bitrate, CDN, recommendation.

#interview#video#cdn
17 min