HyperionGray / starbelly
Streaming web crawler with WebSocket API
☆44Updated last year
Alternatives and similar repositories for starbelly:
Users that are interested in starbelly are comparing it to the libraries listed below
- A component that tries to avoid downloading duplicate content☆27Updated 6 years ago
- https://mimesniff.spec.whatwg.org/ implementation for Python☆14Updated last year
- Scrapy middleware which allows to crawl only new content☆80Updated 2 years ago
- A collaborative platform for creating, editing and sharing JSON objects.☆73Updated 2 months ago
- List of Sanctions and Most wanted☆26Updated 7 years ago
- A generic crawler☆78Updated 6 years ago
- A simple python tool that generates a requests/bs4 based web scraper☆26Updated 2 years ago
- extract difference between two html pages☆32Updated 6 years ago
- A project to attempt to automatically login to a website given a single seed☆123Updated 2 years ago
- Server monitoring and data-collection daemon☆10Updated 5 years ago
- This is the facade for installation and access to the individual components☆15Updated 6 years ago
- Processes data from images which are tagged with the specified Instagram tag.☆13Updated 11 years ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated 5 months ago
- Collect email addresses by crawling search engine results.☆29Updated 2 years ago
- darknet.py is a network application with no dependencies other than Python and Tor, useful to anonymize the traffic of linux servers and …☆69Updated 3 years ago
- With Selenium headless browsing and CAPTCHA solving☆44Updated 2 years ago
- A modern code-injection framework for Python. Like Pyrasite but Kubernetes-aware.☆60Updated 4 months ago
- Async dnsbl spam lists checker based on asyncio/aiodns.☆51Updated 6 months ago
- DomainClassifier is a Python (2/3) library to extract and classify Internet domains/hostnames/IP addresses from raw unstructured text fil…☆77Updated last year
- AnyAPI is a library that helps you to write any API wrapper with ease and in pythonic way.☆132Updated 3 years ago
- Automates the process of repeatedly searching for a website via scraped proxy IP and search keywords☆44Updated last year
- ☆15Updated 6 years ago
- Detect Phishing fetching Certificate Transparency Logs☆20Updated 4 years ago
- Bot for operating snscrape in #archivebot on efnet☆10Updated 5 years ago
- Homoglyphs: get similar letters, convert to ASCII, detect possible languages and UTF-8 group.☆80Updated 4 years ago
- A Python implementation of our efficient Bloom filter library.☆30Updated 5 years ago
- Napkin is a simple tool to produce statistical analysis of a text☆12Updated last year
- Proof of concept implementation of a cyber threat intelligence and incident handling platform☆11Updated 2 years ago
- Extract social media links and account names from websites.☆38Updated 4 years ago
- Parse domains using the TLD list maintained by publicsuffix.org☆61Updated 4 years ago