Preview mode — 111 repos, zero database
The open-source stack,
curated by builders.
Tell us what you're building or what you need. We'll surface the right repo, with an honest take on whether to use it.
Matches for "python"
2 reposunclecode
crawl4ai
Python-native LLM-friendly crawler. Strong at extracting structured data (JSON schemas) from messy HTML using an embedded LLM. Heavier setup than Firecrawl but more control over extraction prompts. Be…
Scraping & CrawlingAI & ML
You're in Python, want self-hosted, and need extraction with an exact JSON schema.
You don't need LLM-driven extraction — Scrapy or Crawlee will be cheaper and faster.
unclecode/crawl4aiView
scrapy
scrapy
The Python scraping veteran. Mature ecosystem, plugins for everything (caching, proxies, middlewares), and a years-honed pipeline architecture. Steeper learning curve than the modern alternatives but…
Scraping & Crawling
You're a Python team scraping at scale and want middleware/pipeline patterns out of the box.
You're scraping JS-heavy SPAs — Scrapy needs Playwright integration which is awkward; Crawlee is cleaner.
scrapy/scrapyView
Don't see what you need?
We'll add it in 60 minutes.
Tell us what tool or use case is missing. We'll research the best repo for it, write an honest take, add it to the directory, and email you the link. No paywall, no signup required.