crawlee

Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

23.7k stars1.4k forks130 watchers137 open issuesTypeScriptApache-2.0

View on GitHub Visit homepage

TL;DR · 30-second scan

What it is

crawlee (TypeScript) — Crawlee—A web scraping and browser automation library for Node.js to build reliable crawlers. In JavaScript and TypeScript. Extract data for AI, LLMs, RAG, or GPTs. Download HTML, PDF, JPG, PNG, and other files from websites. Works with Puppeteer, Playwright, Cheerio, JSDOM, and raw HTTP. Both headful and headless mode. With proxy rotation.

What it does for you

You need to crawl hundreds of thousands of pages reliably in TypeScript or Node.

23.7k GitHub starsLicense: Apache-2.0Last updated 1 month ago

EDITOR'S DEEP TAKE

Node-native crawler from the Apify team. Built-in queues, retries, proxy rotation, headless browser pool — production patterns out of the box. Switches between Playwright, Puppeteer, and plain HTTP based on the page. Best in class for serious Node-based scraping at scale.

Use this if

You need to crawl hundreds of thousands of pages reliably in TypeScript or Node.

Skip if

A 50-line Playwright script will do — Crawlee is overhead for a quick scrape.

Topics

web-scrapingweb-crawlingnpmheadless-chromepuppeteerautomationapifyscrapingcrawlingcrawlerheadlessscraperweb-crawlerjavascriptnodejsplaywrighttypescript

Maintainer? Embed our badge

Add this badge to your README to show your project is curated on StackPicks. Free, lightweight (180×28 SVG), and gives your visitors a one-click way to see honest take + alternatives.

Preview

Markdown (for GitHub README)

[![Featured on StackPicks](https://stackpicks.dev/api/badge/apify-crawlee)](https://stackpicks.dev/repo/apify-crawlee)

HTML (for blogs / docs)

<a href="https://stackpicks.dev/repo/apify-crawlee"><img src="https://stackpicks.dev/api/badge/apify-crawlee" alt="Featured on StackPicks" width="180" height="28" /></a>

Are you the maintainer of apify/crawlee? Add the badge and we'll feature your project in the weekly curator newsletter.

Created 27 Aug 2016

Last push 1 month ago

Stats refreshed 1 month ago