rmax / databrewerLinks
The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!
☆41Updated 8 years ago
Alternatives and similar repositories for databrewer
Users that are interested in databrewer are comparing it to the libraries listed below
Sorting:
- Find which links on a web page are pagination links☆29Updated 9 years ago
- Find elements in HTML by matching them with a skeleton☆25Updated 3 years ago
- Restrict crawl and scraping scope using matchers.☆26Updated 9 years ago
- Faster replacement for Python's urlparse module☆45Updated 7 years ago
- Python implementation of the Parsley language for extracting structured data from web pages☆92Updated 8 years ago
- Modularly extensible semantic metadata validator☆84Updated 10 years ago
- Simple to use python library for Buffer App☆23Updated 3 years ago
- View requests objects with style☆42Updated 9 years ago
- Easy Python packages creation.☆248Updated 5 years ago
- Python library with common functionality for writing web scrapers☆102Updated 10 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆157Updated 4 months ago
- 🕷Configuration based html scraper☆23Updated 3 months ago
- Command-Line utilities for Click (extracted from Clint).☆30Updated 8 years ago
- Python3 SOAP client built with lxml and requests.☆43Updated 5 years ago
- Perform lexical analysis on words, one word at a time.☆64Updated 7 years ago
- csvcat☆22Updated 9 years ago
- Share localhost through SSH. Local/Remote port forwarding made safe and easy.☆109Updated 3 years ago
- 🌆 TouristFriend API lets you query Google Places, Yelp and Foursquare at the same time, with Bayesian rankings!☆29Updated 7 years ago
- An (unofficial) command line interface for Google APIs☆31Updated 2 years ago
- Detect and classify pagination links☆15Updated 5 years ago
- Automatic Item List Extraction☆86Updated 9 years ago
- Small set of utilities to simplify writing Scrapy spiders.☆49Updated 10 years ago
- Simple library to cleanup and prettify url patterns and emails☆137Updated 3 years ago
- A component that tries to avoid downloading duplicate content☆27Updated last week
- python library for extracting html microdata☆167Updated 2 years ago
- Easy way for HTML parsing and building XPath☆135Updated 3 years ago
- Highly optimized geolocation inference package for spatial approximation☆87Updated 2 years ago
- A command-line script to get all the contributors for one or more GitHub projects.☆33Updated 4 years ago
- xmldataset: xml parsing made easy 🗃️☆80Updated 5 years ago
- feedparser but faster and worse☆104Updated 4 years ago