Site Hound (previously THH) is a Domain Discovery Tool
☆24Feb 10, 2026Updated last month
Alternatives and similar repositories for sitehound-frontend
Users that are interested in sitehound-frontend are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- extract difference between two html pages☆33Feb 10, 2026Updated last month
- a tor socks proxy docker image☆12Feb 10, 2026Updated last month
- Show summary of a large number of URLs in a Jupyter Notebook☆19Feb 10, 2026Updated last month
- A component that tries to avoid downloading duplicate content☆28Feb 10, 2026Updated last month
- Broad crawler for domain discovery☆20Feb 10, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Adaptive crawler which uses Reinforcement Learning methods☆169Feb 10, 2026Updated last month
- Simple heuristic for measuring web page similarity (& data set)☆91Feb 23, 2026Updated last month
- A generic crawler☆79Feb 10, 2026Updated last month
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆12Feb 23, 2026Updated last month
- Scrapy middleware for the autologin☆37Feb 10, 2026Updated last month
- Frontera backend to guide a crawl using PageRank, HITS or other ranking algorithms based on the link structure of the web graph, even whe…☆55May 21, 2024Updated last year
- A queue-controlled browser automation tool for improving web crawl quality☆64Aug 13, 2025Updated 7 months ago
- Web Crawling UI and HTTP API, based on Scrapy and Tornado☆161Feb 10, 2026Updated last month
- Extract text from HTML☆135Feb 10, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Highlight and select phrases in HTML pages.☆24Nov 4, 2019Updated 6 years ago
- Send data about Celery events to statsd☆29May 20, 2023Updated 2 years ago
- ☆20Oct 2, 2024Updated last year
- Modules for the Stratos ERP project☆13May 15, 2023Updated 2 years ago
- Youtube comments topics modeling and sentiment analyzer☆16Oct 25, 2022Updated 3 years ago
- Python scripts to scrape Metadata and Comments of Youtube Videos☆19Apr 6, 2017Updated 8 years ago
- [Deprecated] Docker image to run an out-of-the-box Memcached server☆12Mar 31, 2017Updated 8 years ago
- Paginating the web☆37Feb 11, 2014Updated 12 years ago
- A fork of http://pydispatcher.sourceforge.net/ with PyPy support☆16Jul 3, 2017Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- C++ library to parse WARC files☆11Jan 27, 2019Updated 7 years ago
- Trough: Big data, small databases.☆42Jul 25, 2024Updated last year
- The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!☆41May 29, 2017Updated 8 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆48Mar 19, 2018Updated 8 years ago
- Detect and classify pagination links☆107Feb 10, 2026Updated last month
- This plugin provides a useful feature for multi-language☆14Jul 15, 2022Updated 3 years ago
- Minimal web-based client for NewsBlur.☆20Dec 7, 2014Updated 11 years ago
- Create your custom Qt + PyQt SDK for multiple platforms☆10Jun 7, 2019Updated 6 years ago
- Wrapper to run 2to3 automatically at import time☆13Dec 9, 2011Updated 14 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A curated list of amazingly libraries, services and resources to work with PDF files☆16Jan 28, 2026Updated 2 months ago
- ☆18Feb 18, 2016Updated 10 years ago
- Common methods to help create fabric deplopment scripts for django☆35Jan 28, 2010Updated 16 years ago
- A “Hello World” of calling Rust code from a Python program with CFFI, in order to show packaging issues☆11Jul 14, 2016Updated 9 years ago
- MySQL backend for Django based on the PyMySQL database adapter☆24Nov 30, 2012Updated 13 years ago
- A model field to store a file size, whose edition and display shows units (KB, MB, ...)☆18Jun 29, 2023Updated 2 years ago
- Recurrent Neural Networks for Speaker and Turn Taking Classification☆12Aug 29, 2018Updated 7 years ago