crawlee
Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
TL;DR · 30-second scan
crawlee (TypeScript) — Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.
You need to crawl hundreds of thousands of pages reliably in TypeScript or Node.
Node-native crawler from the Apify team. Built-in queues, retries, proxy rotation, headless browser pool — production patterns out of the box. Switches between Playwright, Puppeteer, and plain HTTP based on the page. Best in class for serious Node-based scraping at scale.
You need to crawl hundreds of thousands of pages reliably in TypeScript or Node.
A 50-line Playwright script will do — Crawlee is overhead for a quick scrape.
Add this badge to your README to show your project is curated on StackPicks. Free, lightweight (180×28 SVG), and gives your visitors a one-click way to see honest take + alternatives.
[](https://stackpicks.dev/repo/apify-crawlee)
<a href="https://stackpicks.dev/repo/apify-crawlee"><img src="https://stackpicks.dev/api/badge/apify-crawlee" alt="Featured on StackPicks" width="180" height="28" /></a>
Are you the maintainer of apify/crawlee? Add the badge and we'll feature your project in the next weekly newsletter (~2,000 builders).