rmax / databrewer
The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!
☆41Updated 7 years ago
Alternatives and similar repositories for databrewer:
Users that are interested in databrewer are comparing it to the libraries listed below
- Find elements in HTML by matching them with a skeleton☆25Updated 2 years ago
- Faster replacement for Python's urlparse module☆46Updated 6 years ago
- A component that tries to avoid downloading duplicate content☆27Updated 6 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- 🕷Configuration based html scraper☆22Updated 7 months ago
- A high-performance distributed web crawling & scraping framework written with golang and python.☆30Updated 8 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 9 years ago
- Context manager to maintain your temporary directories/files.☆17Updated last year
- Modularly extensible semantic metadata validator☆83Updated 9 years ago
- A native web-based client for Slack.☆23Updated 7 years ago
- templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.☆63Updated 4 years ago
- ☆46Updated 7 years ago
- Perform lexical analysis on words, one word at a time.☆64Updated 6 years ago
- High Level Kafka Scanner☆19Updated 7 years ago
- Exporters is an extensible export pipeline library that supports filter, transform and several sources and destinations☆40Updated 7 months ago
- Analysis engine for movie watching habits☆22Updated 6 years ago
- a better repr for closures☆11Updated 8 years ago
- Simple CLI tool to inspect your Python modules☆20Updated 8 years ago
- A scrapy extension to store requests and responses information in storage service☆26Updated 2 years ago
- Restrict crawl and scraping scope using matchers.☆25Updated 8 years ago
- A fork of http://pydispatcher.sourceforge.net/ with PyPy support☆16Updated 7 years ago
- Python implementation of the Parsley language for extracting structured data from web pages☆92Updated 7 years ago
- Paginating the web☆37Updated 10 years ago
- Commit Counter Chart is a Python Flask app to view git history using D3.js☆38Updated 8 years ago
- HTTP client for Open API☆59Updated 8 years ago