Tagged: distributed-systems

The words FAIR QUEUE in monospace with an orange admission arrow cutting through a row of waiting dots

Designing a fair queue at scale: lessons from high-demand ticket on-sales

Traces the distributed-systems problem behind a virtual waiting room: admission control under a thundering herd, the fairness-versus-throughput tradeoff, clock skew in queue ordering, the signed token design, and the failure modes that leak slots.

waiting-room infrastructure distributed-systems

Sat, April 25, 2026 · 24 min read

Designing a distributed crawler: frontier, dedup, politeness, and backpressure

Traces the architecture of a web-scale crawler from Mercator and the early Googlebot through IRLbot to today: the URL frontier, duplicate elimination, politeness scheduling, and how servers push back.

crawling distributed-systems infrastructure

Sun, March 29, 2026 · 21 min read

URL frontier design: from Mercator to modern priority-queue crawlers

How the URL frontier orders a crawl: the Mercator front-queue/back-queue split, per-host politeness, freshness versus coverage, and the disk-backed and gRPC designs that run at web scale today.

crawling distributed-systems infrastructure

Sat, March 28, 2026 · 22 min read

The word Bloom filters in monospace with an orange underline and the caption seen(url) returns maybe or definitely-not

Bloom filters and the URL-seen problem in web-scale crawling

A primary-source walk through the URL-seen problem in large crawlers: why naive dedup fails at scale, how Bloom filters answer it, the false-positive math, and the counting, scalable, blocked, and cuckoo variants that followed.

crawling distributed-systems algorithms

Thu, March 26, 2026 · 23 min read

Consistent hashing: the algorithm behind every modern load balancer and cache

Traces consistent hashing from Karger's 1997 ring to virtual nodes, jump hash, Maglev tables, and the bounded-load variant that Vimeo shipped in HAProxy, with the minimal-remapping math that ties them together.

infrastructure algorithms distributed-systems

Tue, December 30, 2025 · 21 min read