paulsmith / templatemaker
templatemaker is a Python library that can extract data from files with a similar format, like HTML pages.
☆63Updated 4 years ago
Alternatives and similar repositories for templatemaker:
Users that are interested in templatemaker are comparing it to the libraries listed below
- feedparser but faster and worse☆103Updated 3 years ago
- unofficial git mirror of http://svn.whoosh.ca svn repo☆49Updated 15 years ago
- Readability/Boilerpipe extraction in Python☆55Updated 8 years ago
- Efficiently search the most similar strings against the query in Python.☆18Updated last month
- Modularly extensible semantic metadata validator☆84Updated 9 years ago
- MapReduce platform in python☆34Updated 9 years ago
- Pythonic interface to redis-py☆98Updated 7 years ago
- Python bindings for the Google's FarmHash☆38Updated 8 months ago
- Python's missing statistical Swiss Army knife☆15Updated 9 years ago
- Extract, parse and populate templates from strings☆27Updated 6 years ago
- A Python parser for data that only looks like JSON☆65Updated last year
- The missing datasets manager. Like hombrew but for datasets. CLI-tool for search and discover datasets!☆39Updated 7 years ago
- Regular Expression based parsers for extracting data from natural languages☆70Updated 7 years ago
- A powerful analytics python library for Redis.☆36Updated 10 years ago
- Find which links on a web page are pagination links☆29Updated 8 years ago
- Python library for creating word clouds from text☆51Updated 5 years ago
- embedded graph datastore☆185Updated 6 years ago
- A module for querying the DOM tree and writing XPath expressions using native Python syntax.☆127Updated 6 years ago
- A high-performance distributed web crawling & scraping framework written with golang and python.☆30Updated 8 years ago
- High Level Kafka Scanner☆19Updated 7 years ago
- Splicer - adds relation querying (SQL) to any python project☆72Updated 3 years ago
- A Cython implementation of the affine gap string distance☆57Updated 2 years ago
- Useful tools for working with iterators☆167Updated 8 years ago
- Lightweight, multilingual natural language processing☆63Updated 12 years ago
- ... just because nltk is too heavy☆35Updated 14 years ago
- A Python implementation of the Double Metaphone algorithm☆61Updated 14 years ago
- Convert URL's to a normalized unicode format☆67Updated 7 years ago
- RGP -- Redis Graph via Python☆30Updated 9 years ago
- Simple plotting for Python. Python wrapper for D3xter - render charts in the browser with simple Python syntax.☆31Updated 6 years ago
- Utilities to help fight and prevent memory leaks☆174Updated 2 years ago