rmax / databrewerLinks
The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!
β41Updated 8 years ago
Alternatives and similar repositories for databrewer
Users that are interested in databrewer are comparing it to the libraries listed below
Sorting:
- Find elements in HTML by matching them with a skeletonβ25Updated 3 years ago
- π·Configuration based html scraperβ23Updated 3 months ago
- Faster replacement for Python's urlparse moduleβ45Updated 7 years ago
- Find which links on a web page are pagination linksβ29Updated 9 years ago
- Modularly extensible semantic metadata validatorβ84Updated 10 years ago
- Restrict crawl and scraping scope using matchers.β26Updated 9 years ago
- Detect and classify pagination linksβ15Updated 5 years ago
- A Python parser for data that only looks like JSONβ65Updated 2 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.β157Updated 4 months ago
- Python implementation of the Parsley language for extracting structured data from web pages