santinic / htmlmatch
Python tool for automatic data scraping from Html templates
☆19Updated 8 years ago
Related projects ⓘ
Alternatives and complementary repositories for htmlmatch
- A Scrapy crawler for http://books.toscrape.com☆27Updated 7 years ago
- A scraper focused on organizational Github accounts and their members.☆40Updated 2 years ago
- Python and pandas tools to perform various analyses on different types of word lists☆16Updated 9 years ago
- Stylometric framework in Python☆13Updated 9 years ago
- Command Line Application for Job Search☆13Updated 7 years ago
- (Deprecated - please use https://github.com/gmarmstrong/python-datamuse) Python wrapper for the Datamuse API☆15Updated 6 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆21Updated 4 years ago
- Functional composable pipelines allowing clean separation of the business logic and its implementation☆11Updated 5 months ago
- Processes data from images which are tagged with the specified Instagram tag.☆13Updated 10 years ago
- https://mimesniff.spec.whatwg.org/ implementation for Python☆14Updated 10 months ago
- 💻 Terminal-like Python input( ) function.☆19Updated 5 years ago
- Automatically install missing Python modules using pip at import time.☆18Updated 10 months ago
- Demo of the Newspaper article extraction library.☆29Updated 10 years ago
- List of libraries, tools and APIs for web scraping and data processing.☆13Updated 9 years ago
- bamboolib - template for creating your own binder notebook☆21Updated 2 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated last month
- Virtual patent marking crawler at iproduct.epfl.ch☆14Updated 7 years ago
- Programmable browser for functional black-box tests☆21Updated 5 months ago
- Scraping Assisted by Learning☆35Updated 2 months ago
- Python 3 implementation and documentation of the Hermina-Janos local graph clustering algorithm.☆21Updated last year
- Example nteract notebooks with links to execution on mybinder.org☆27Updated last year
- Simple tools for summarizing .mbox email archives.☆10Updated 4 years ago
- Dataset of 125,000 Medium Blog Post Titles and Subtitles (with Categories)☆21Updated 5 years ago
- Proxy-list management application for Django☆23Updated 6 years ago
- An HTTP log monitoring tool for your terminal☆21Updated 4 years ago
- Smarties is a Text Classifier using an innovative method based on Wikipedia to classify any documents/text. We use a Machine Learning and…☆21Updated 6 years ago
- Fast extraction of all external links from wikipedia☆10Updated 6 years ago
- A Python command line tool that creates a Table of Contents for Markdown documents☆92Updated 6 years ago