INNOVINATI / microwler

A micro-framework for asynchronous deep crawls and web scraping with Python

☆13

Related projects ⓘ

Alternatives and complementary repositories for microwler

scrapy / xtractmime
https://mimesniff.spec.whatwg.org/ implementation for Python
☆14Updated 10 months ago
roshanlam / SearchGar
SearchGar - An actual Search Engine made using Python
☆18Updated 2 years ago
python-ruia / awesome-ruia
A list of awesome project for Ruia
☆13Updated 2 years ago
KoustavCode / pyenvcomp
A simple commandline utility to visually compare two python virtual environments.
☆15Updated 4 years ago
adipasquale / techcrunch-incremental-scrapy-spider-with-mongodb
Techcrunch Incremental Scrapy Spider With MongoDB
☆16Updated 5 years ago
chuanconggao / html2json
Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.
☆21Updated 4 years ago
terror / paragon
A lightweight command line benchmarking utility
☆13Updated 3 years ago
HyperionGray / starbelly
Streaming web crawler with WebSocket API
☆44Updated last year
pamoroso / spacestills
A NASA TV still frame viewer
☆14Updated 11 months ago
scrapy-plugins / scrapy-headless
☆29Updated 3 years ago
rootVIII / proxy_web_crawler
Automates the process of repeatedly searching for a website via scraped proxy IP and search keywords
☆42Updated last year
fangfufu / reddit-regex-counter
Count the number of matches for a regex string in a subreddit
☆11Updated 4 years ago
itmaster921 / monitoring
Server monitoring and data-collection daemon
☆10Updated 5 years ago
pyrustic / shared
Triptych for data exchange and persistence
☆23Updated 8 months ago
gsmecher / awaitless
ipython + REPL + coroutines - suffering
☆16Updated 2 months ago
M4cs / jsonsty
Free Cloud JSON Storage Written in Python
☆14Updated last year
verifid / ner-d
Python module for Named Entity Recognition (NER) using natural language processing.
☆14Updated 3 years ago
alttch / atasker
Python library for modern thread / multiprocessing pooling and task processing via asyncio
☆15Updated 3 years ago
piccolo-orm / targ
Python CLI using type hints and docstrings.
☆20Updated 4 months ago
xtream1101 / scraperx
Library for scraping websites or apis at any scale
☆54Updated 9 months ago
Girbons / mercury-parserpy
python api wrapper for https://mercury.postlight.com/web-parser/
☆23Updated last year
mkrd / Flask-Squeeze
Automatically minify JS/CSS and compress all responses with brotli, defalte or gzip, with caching for static assets
☆11Updated 5 months ago
sanic-org / tracerite
Tracebacks for Humans (in Jupyter notebooks)
☆12Updated 7 months ago
purarue / autotui
quickly create UIs to interactively prompt, validate, and persist python objects to disk (JSON/YAML) and back using type hints
☆13Updated 3 weeks ago
rednafi / fork-purger
Delete all of your forked repositories on Github
☆33Updated last year