Tagged: browser-automation — page 2

The lifecycle of a stealth patch: discovery, fix, detection, and re-discovery

Traces how a single browser-automation stealth patch moves through its life: a signal is found, the patch hides it, the patch itself becomes a fingerprint, and a new signal replaces the old one. With real examples and the economics of the treadmill.

browser-automation stealth anti-bot

Mon, March 30, 2026 · 19 min read

Parsing at scale: when to use a real browser vs an HTTP client

A decision framework for choosing between a headless browser and a plain HTTP client at extraction scale: JS-dependence, per-page cost, fingerprint surface, brittleness, and the hybrid path most large crawlers actually take.

crawling browser-automation infrastructure

Tue, March 17, 2026 · 18 min read

The headless-browser tax: memory, CPU, and why HTTP clients win when they can

Traces the real resource cost of driving headless Chrome at scale: per-instance RAM, the multi-process tax, container failure modes, concurrency math, and the cost gap that pushes teams back to HTTP clients.

crawling browser-automation infrastructure

Mon, March 16, 2026 · 22 min read

Handling JavaScript-rendered content without a browser: API discovery and XHR replay

How to pull JavaScript-rendered data without launching a browser: finding the backend JSON, XHR, and GraphQL endpoints a page calls, replaying them, handling tokens and request signatures, and where the approach stops working.

crawling browser-automation reverse-engineering

Fri, March 13, 2026 · 20 min read

The navigator object as a fingerprint: every property a detector reads

A reference to the navigator object's fingerprinting surface: userAgent, platform, languages, hardwareConcurrency, deviceMemory, vendor, productSub, and webdriver, plus the cross-property consistency checks that catch a spoof.

device-fingerprinting fingerprinting browser-automation

Sat, February 7, 2026 · 21 min read

The history of Selenium and WebDriver: from 2004 to the W3C standard

Traces Selenium from Jason Huggins's 2004 JavaScriptTestRunner through Selenium RC's proxy hack, the 2009 WebDriver merger, and WebDriver becoming a W3C Recommendation in 2018.

history browser-automation web-standards

Sat, October 18, 2025 · 22 min read

The history of Puppeteer and the headless-Chrome era it launched

Traces Puppeteer from the April 2017 headless-Chrome announcement through its CDP foundation, the stealth-plugin arms race, the team's departure to build Playwright, and the long shadow it cast over scraping.

history browser-automation cdp

Fri, October 17, 2025 · 19 min read

The history of anti-detect browsers: from multi-accounting to fingerprint spoofing

How anti-detect browsers grew out of carding forums and affiliate multi-accounting into a commercial tool category, and why the work moved from JavaScript spoofing into the browser engine itself.

history anti-detect-browser browser-automation

Mon, October 6, 2025 · 20 min read