Back to browse
GitHub Repository

Open-source web crawler in Rust

1 starsHTML

I built a production-ready web crawler in Rust with TTL and anti-dupe

by qmay-rust·Jun 19, 2026·2 points·1 comment

AI Analysis

MidShip It

Competent Rust crawler when Scrapy and crawlee already solve this.

Strengths
  • Tokio-based async architecture handles concurrent requests efficiently
  • Pause/resume preserves queue state without losing progress
Weaknesses
  • Web crawling is a solved category with mature open-source alternatives
  • Only 3 commits and zero stars suggests early-stage without traction
Target Audience

Developers building search indexes or content aggregators

Similar To

Scrapy · crawlee · Heritrix

Similar Projects

Developer Tools●●Solid

Fullbleed – Rust HTML/CSS-to-PDF with Deterministic Output+Python CLI

It skips headless Chromium entirely and implements an HTML/CSS-to-PDF pipeline in Rust, exposing a Python wheel and CLI that releases the GIL and uses Rayon for parallel batch renders. The deterministic bits — fixed-point base unit, --repro-record/--repro-check, SHA256 outputs and vendored assets — are a clear, practical play for audited VDP/transactional workflows; what's still unknown is CSS spec coverage and whether subtle print-layout quirks will require hand-holding.

WizardryNiche Gem
krflol
204mo ago

Production-Ready Django and React Auth Starter

Wires the annoying auth plumbing — JWT with refresh/rotation, email activation, password reset, UUID user model, rate limiting and Swagger out of the box — so you can skip weeks of setup. The frontend is modern (TS + Vite, Tailwind, Radix) and includes route guards, but this is an incremental starter in a crowded space; I'd like to see deployment/CI examples or opinionated infra choices to make it truly turnkey.

Niche GemSolve My Problem
okchildhood
114mo ago