lugensa / scorchedLinks
Sunburnt offspring solr client
☆27Updated 3 years ago
Alternatives and similar repositories for scorched
Users that are interested in scorched are comparing it to the libraries listed below
Sorting:
- Modularly extensible semantic metadata validator☆84Updated 9 years ago
- Skinfer is a tool for inferring and merging JSON schemas☆139Updated last year
- Manage and load dataprotocols.org Data Packages☆27Updated 9 years ago
- A tiny library for Python text normalisation. Useful for ad-hoc text processing.☆153Updated 3 weeks ago
- [UNMAINTAINED] Deploy, run and monitor your Scrapy spiders.☆11Updated 10 years ago
- Python to Gremlin Graph Abstraction Layer☆55Updated 8 years ago
- python library for extracting html microdata☆166Updated 2 years ago
- Streaming newline delimited JSON I/O.☆12Updated 2 years ago
- Python library for class-based schema definition, object serialization and data validation☆61Updated 9 years ago
- Extract, parse and populate templates from strings☆27Updated 6 years ago
- Python implementation of the Parsley language for extracting structured data from web pages☆92Updated 7 years ago
- Framework for making good Python API client libraries using urllib3.☆88Updated 6 years ago
- mltk - Moz Language Tool Kit☆12Updated 10 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Faster replacement for Python's urlparse module☆46Updated 6 years ago
- Tool to flatten stream of JSON-like objects, configured via schema☆33Updated 5 years ago
- Makes it easy to respect rate limits.☆96Updated 8 years ago
- Regular Expression based parsers for extracting data from natural languages☆70Updated 8 years ago
- Pure Python wrapper to the Yajl C Library☆84Updated 8 months ago
- DEPRECATED: Video data for Python related conferences☆106Updated 9 years ago
- Restrict crawl and scraping scope using matchers.☆26Updated 9 years ago
- Lightweight data validation and adaptation Python library.☆263Updated 2 years ago
- A slim, non-SWIG Python adapter to CTesseract (Tesseract OCR for C).☆24Updated 11 years ago
- Utilities for data cleaning and ETL processing☆23Updated 7 years ago
- Street address parser and formatter☆91Updated 5 years ago
- csvcat☆22Updated 9 years ago
- A series of tubes.☆56Updated 11 months ago
- An attempt at creating a silver/gold standard dataset for backtesting yesterday & today's content-extractors☆35Updated 10 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 10 years ago
- Unicode transliteration in Python (clone of Tomaž Šolc repository at zemanta.com)☆114Updated 9 years ago