santinic / htmlmatchLinks
Python tool for automatic data scraping from Html templates
☆19Updated 9 years ago
Alternatives and similar repositories for htmlmatch
Users that are interested in htmlmatch are comparing it to the libraries listed below
Sorting:
- A scraper focused on organizational Github accounts and their members.☆42Updated 2 years ago
- Scraping Assisted by Learning☆35Updated 2 months ago
- A simple Web crawler for stackshare.io using scrapy .☆9Updated 6 years ago
- sync a website or local spreadsheet with a google sheet☆35Updated 2 years ago
- Turn your IPython console into a cross-database SQL client☆31Updated 9 years ago
- Example nteract notebooks with links to execution on mybinder.org☆29Updated 2 years ago
- bringing sanity to world of messed-up data☆33Updated last year
- Information extraction and interactive visualization of textual datasets for investigative data-driven journalism and eDiscovery☆56Updated last year
- A Scrapy crawler for http://books.toscrape.com☆27Updated 8 years ago
- A financial disclosure data extraction tool.☆16Updated last year
- Using NLP to find and extract specific information from long, unstructured documents☆15Updated 7 years ago
- List of libraries, tools and APIs for web scraping and data processing.☆13Updated 9 years ago
- Automates the process of repeatedly searching for a website via scraped proxy IP and search keywords☆45Updated last year
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- Personal Knowledge Management System. Capture your ideas using plain old text files. Make a journal that lasts 100 years.☆29Updated last year
- Ask questions about government data.☆38Updated 6 years ago
- Processes data from images which are tagged with the specified Instagram tag.☆13Updated 11 years ago
- Find rss, atom, xml, and rdf feeds on webpages☆30Updated 9 months ago
- Advanced similarity and duplicate source code proof of concept for our research efforts.☆52Updated 2 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆23Updated last month
- Python wrapper for a C++ Double Metaphone☆15Updated last week
- Colored symbols for various log levels for Python☆42Updated 11 months ago
- Demo of the Newspaper article extraction library.☆29Updated 10 years ago
- Pre-built Scrapy spiders for AutoExtract☆19Updated last year
- Console program to get global ranking for a given website or domain☆21Updated last month
- bamboolib - template for creating your own binder notebook☆21Updated 3 years ago
- Python and pandas tools to perform various analyses on different types of word lists☆16Updated 10 years ago
- Pythonic package for combinatorics☆50Updated 3 years ago
- Scraper built with Scrapy.☆18Updated 11 months ago
- Datasette showing global power plant data from https://github.com/wri/global-power-plant-database☆17Updated last month