rmax / databrewerLinks
The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!
☆41Updated 8 years ago
Alternatives and similar repositories for databrewer
Users that are interested in databrewer are comparing it to the libraries listed below
Sorting:
- Find elements in HTML by matching them with a skeleton☆25Updated 3 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- 🕷Configuration based html scraper☆23Updated 7 months ago
- Faster replacement for Python's urlparse module☆46Updated 7 years ago
- Modularly extensible semantic metadata validator☆84Updated 9 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆155Updated last month
- Restrict crawl and scraping scope using matchers.☆26Updated 9 years ago
- View requests objects with style☆42Updated 8 years ago
- Perform lexical analysis on words, one word at a time.☆64Updated 7 years ago
- Easy way for HTML parsing and building XPath☆135Updated 3 years ago
- Simple library to cleanup and prettify url patterns and emails☆137Updated 3 years ago
- Highly optimized geolocation inference package for spatial approximation☆87Updated 2 years ago
- PyQuery-based scraping micro-framework.☆118Updated 3 years ago
- Share localhost through SSH. Local/Remote port forwarding made safe and easy.☆109Updated 3 years ago
- A component that tries to avoid downloading duplicate content☆27Updated 7 years ago
- Ice - WSGI on the rocks☆60Updated 8 years ago
- Python implementation of the Parsley language for extracting structured data from web pages☆92Updated 7 years ago
- Easy Python packages creation.☆249Updated 5 years ago
- Flask extension that creates a simple interface to the Bitmapist analytics library.☆37Updated 4 years ago
- Python library with common functionality for writing web scrapers☆102Updated 10 years ago
- imgspy finds the metadata (type, size) of an image given its url by fetching as little as needed☆55Updated 5 years ago
- Command-Line utilities for Click (extracted from Clint).☆30Updated 8 years ago
- Paginating the web☆37Updated 11 years ago
- HTTP client for Open API☆60Updated 9 years ago
- Python library for Hoverfly (now obsolete)☆79Updated 2 years ago
- A Python parser for data that only looks like JSON☆65Updated 2 years ago
- An extended version of the official Elasticsearch Python client.☆63Updated 9 years ago
- Simple plotting for Python. Python wrapper for D3xter - render charts in the browser with simple Python syntax.☆31Updated 7 years ago
- butterdb is a Python object mapper for Google Drive Spreadsheets. Still in development, but usable.☆340Updated 10 years ago
- python library for extracting html microdata☆166Updated 2 years ago