simon987 / ArchiteuthisLinks
MITM HTTP(S) proxy with integrated load-balancing, rate-limiting and error handling. Built for automated web scraping.
☆41Updated 5 years ago
Alternatives and similar repositories for Architeuthis
Users that are interested in Architeuthis are comparing it to the libraries listed below
Sorting:
- Convert HTTP Archive (HAR) -> Web Archive (WARC) format☆54Updated 7 years ago
- A server to collect & archive websites that also supports video downloads☆86Updated 2 years ago
- Distributed crawler, database and web frontend for public directories indexing☆141Updated 5 years ago
- A UDP torrent tracker scraper library written in Python 3☆52Updated 2 years ago
- Open directory indexer☆10Updated 2 years ago
- web-based epub indexer☆87Updated last year
- 🧠 AI powered image tagger backed by DeepDetect☆250Updated 7 years ago
- Subtitle Download Service☆16Updated 2 years ago
- Support for writing WARC files with Scrapy☆23Updated 5 years ago
- Like Tor2Web, but not just HTTP ( using IPv6 )☆127Updated 4 years ago
- The Seesaw pipeline grab script for the URLTeam (terroroftinytown) project☆28Updated 4 months ago
- Easily archive important Reddit post threads onto your computer☆63Updated 3 years ago
- A self-hosted drag-and-drop, nosql yet fully-featured file-scanning server.☆30Updated 3 years ago
- Configure, launch, and work in Dockerized environments☆32Updated 5 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆47Updated 7 years ago
- Reverse image/video search for reddit☆11Updated 5 years ago
- Scrape posts, threads from forums, news aggregators, mail archives, export to JSONL, mailbox, WARC☆105Updated last year
- A DHT crawler and torrent indexer☆109Updated 7 years ago
- Filtering reverse HTTP proxy☆176Updated 2 years ago
- 📒 Easy and effective communication for any team or community.☆33Updated 3 years ago
- Easy to use rclone mount/unmount scripts☆10Updated 6 years ago
- Search in multiple torrent sites from your CLI☆69Updated 2 years ago
- Scrapes an arbitrary number of lines from a Discord channel☆24Updated 6 years ago
- Dynamic image server for web and print☆88Updated 3 years ago
- Fast full text search for email☆34Updated 4 years ago
- collected yatb sources + small fixes☆14Updated 3 years ago
- 📃 Media Request System☆19Updated 5 years ago
- Automated database snapsots of the nZEDb IRC predb bot.☆31Updated last year
- Download any song mp3 with no dependencies except ffmpeg☆130Updated 5 years ago
- A terminal client to access srrdb.com☆21Updated 3 years ago