Tagged: web-standards

Selenium's bidirectional protocol and the WebDriver BiDi migration

How WebDriver BiDi gives the W3C automation standard the bidirectional channel that CDP had, why Selenium and Firefox are moving onto it, and what the switch changes for bot detection.

browser-automation selenium web-standards

Wed, April 15, 2026 · 22 min read

A robots.txt user-agent and disallow block with an orange crawl-delay line

Crawl politeness: robots.txt, crawl-delay, and the unwritten rules of scale

Traces how crawl politeness works in practice: RFC 9309 robots.txt parsing, the crawl-delay split between Google, Bing, and Yandex, per-host rate limits, sitemaps, and the cryptographic verification replacing the honor system.

crawling robots-txt web-standards

Fri, March 27, 2026 · 25 min read

The word GREASE in monospace with one byte value 0x0a0a highlighted in orange

GREASE values in the ClientHello and why they break naive fingerprinting

Traces GREASE (RFC 8701), the reserved random values browsers inject into the TLS ClientHello to keep extension points usable, and the reason fingerprinting that fails to normalize them produces a different hash on every connection.

tls fingerprinting web-standards

Sun, March 8, 2026 · 19 min read

Encrypted Client Hello (ECH): what it hides and what it doesn't

How ECH encrypts the inner ClientHello, including SNI, with an HPKE key fetched from DNS, what the outer ClientHello still leaks, and where deployment actually stands now that RFC 9849 has shipped.

tls ech privacy web-standards

Mon, March 2, 2026 · 20 min read

The Battery Status API and the privacy disaster that got it deprecated

Traces how four read-only battery properties became a cross-site tracking vector, the 2015 Olejnik research that proved it, the in-the-wild scripts Princeton caught, and the Firefox and WebKit removals that followed.

device-fingerprinting privacy web-standards

Thu, February 5, 2026 · 21 min read

How HTTP caching headers really work: Cache-Control, Vary, and revalidation

A primary-source reference for HTTP caching: how Cache-Control directives, Expires, ETag and Last-Modified revalidation, Vary, and the stale-* extensions actually behave in private and shared caches under RFC 9111.

http caching web-standards

Mon, December 29, 2025 · 25 min read

The cookie and identity layer: SameSite, partitioning, and the third-party-cookie death

Traces the HTTP cookie from a 1994 shopping-cart hack to the web's identity layer: how SameSite reshaped it, why the third-party-cookie phase-out collapsed in 2024-2025, and what partitioning leaves behind.

cookies privacy web-standards

Sun, December 28, 2025 · 18 min read

CHIPS and partitioned cookies: the post-third-party-cookie identity model

A primary-source walk through CHIPS: the Partitioned cookie attribute, the double-keyed cookie jar, the cross-site ancestor chain bit, the 10 KiB per-partition budget, and where it sits now that Privacy Sandbox is gone.

cookies privacy web-standards

Sat, December 27, 2025 · 21 min read

How Encrypted SNI became ECH: the long road to hiding the hostname

A history of encrypting the TLS server name, from the 2018 ESNI experiment and why it failed to the ECH design that encrypts the whole inner ClientHello with HPKE, finished as RFC 9849 in 2026.

tls privacy web-standards

Fri, December 26, 2025 · 18 min read

CORS, the same-origin policy, and the long history of cross-origin trust

Traces the same-origin policy from Netscape 1995 to RFC 6454, then how CORS relaxes it through preflights and Access-Control headers, the misconfigurations that break it, and where the model stands in 2026.

web-security web-standards browser

Wed, November 19, 2025 · 21 min read

Set-Cookie line with HttpOnly, Secure, and SameSite attributes, __Host- prefix highlighted in orange

Cookie security: HttpOnly, Secure, SameSite, and the __Host- prefix

A primary-source reference for the cookie security attributes: what HttpOnly, Secure, SameSite, Domain, and Path each enforce, why the __Host-/__Secure- prefixes exist, and the gaps each one leaves behind.

web-security cookies web-standards

Tue, November 18, 2025 · 24 min read

Content Security Policy: how CSP works and why it's so hard to deploy

A reference on CSP: the directive and source-list model, nonces, hashes and strict-dynamic, report-only mode, the Google study that showed most real-world policies were bypassable, and why retrofitting a strict policy is so painful.

web-security csp web-standards

Mon, November 17, 2025 · 21 min read

The word integrity in large monospace type with a single orange underline under the sha384 hash prefix beneath it

Subresource Integrity and the supply-chain risk of third-party scripts

Traces how the integrity attribute verifies a third-party script against a cryptographic hash, what a compromised CDN it stops, the dynamic-resource gap it cannot close, and why adoption stayed in single digits.

web-security supply-chain web-standards

Sun, November 16, 2025 · 20 min read

The history of web scraping: from wget to headless Chrome, 1994-2026

Traces automated web extraction from the 1993 Wanderer and JumpStation through wget, Perl LWP, the API era, Scrapy, Selenium, the headless-Chrome shift, and the AI-training wave, with the legal landmarks along the way.

history crawling web-standards

Sat, November 1, 2025 · 25 min read

A history of the robots.txt standard, from 1994 consensus to RFC 9309

Traces robots.txt from Martijn Koster's 1994 mailing-list proposal through 25 years as a de-facto standard, Google's 2019 push, RFC 9309 in 2022, and the 2024-2025 AI-crawler revolt and llms.txt debate.

history robots-txt web-standards

Fri, October 31, 2025 · 22 min read

The history of HTTP: from 0.9 to HTTP/3, told through its RFCs

Traces HTTP from Berners-Lee's one-line 1991 protocol through RFC 1945, the RFC 2068/2616/7230 era of HTTP/1.1, Google's SPDY, HTTP/2 (RFC 7540/9113), and HTTP/3 over QUIC (RFC 9114).

history http web-standards

Tue, October 28, 2025 · 22 min read

The word COOKIE in monospace with a Set-Cookie header underline in orange

The history of the cookie: from Lou Montulli's 1994 hack to SameSite

Traces the HTTP cookie from Lou Montulli's 1994 design at Netscape through RFC 2109, 2965, and 6265, the third-party tracking era, and the SameSite phase-out endgame that never quite arrived.

history cookies web-standards

Sat, October 25, 2025 · 22 min read

The history of Selenium and WebDriver: from 2004 to the W3C standard

Traces Selenium from Jason Huggins's 2004 JavaScriptTestRunner through Selenium RC's proxy hack, the 2009 WebDriver merger, and WebDriver becoming a W3C Recommendation in 2018.

history browser-automation web-standards

Sat, October 18, 2025 · 22 min read

The history of QUIC: from Google's 2012 experiment to RFC 9000

How QUIC went from a 2012 Google experiment in Chrome and YouTube to a standardized IETF transport, traced through gQUIC, the TLS 1.3 redesign, HTTP/3, and the May 2021 publication of RFC 9000.

history quic web-standards

Fri, October 10, 2025 · 22 min read