santinic / htmlmatchLinks
Python tool for automatic data scraping from Html templates
☆19Updated 9 years ago
Alternatives and similar repositories for htmlmatch
Users that are interested in htmlmatch are comparing it to the libraries listed below
Sorting:
- A Scrapy crawler for http://books.toscrape.com☆27Updated 8 years ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆14Updated 6 months ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆23Updated 3 months ago
- Colored symbols for various log levels for Python☆42Updated last year
- Automatically exported from code.google.com/p/guess-language☆52Updated last year
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆190Updated 3 years ago
- A machine readable JSON QAnon dataset, archiving all QAnon drops for research only☆28Updated 3 months ago
- Example nteract notebooks with links to execution on mybinder.org☆29Updated 2 years ago
- Automates the process of repeatedly searching for a website via scraped proxy IP and search keywords☆45Updated last year
- Word analysis, by domain, on the Common Crawl data set for the purpose of finding industry trends☆57Updated last year
- OneResumé is a data-driven resumé generator for text and Microsoft Word documents.☆14Updated 10 years ago
- Quickly turn command-line applications into RESTful webservices with a web-application front-end. You provide a specification of your com…☆131Updated 5 months ago
- useful python script and snippets of code.☆66Updated 2 years ago
- Writing a Simple DSL in Python☆22Updated 7 years ago
- Personal Knowledge Management System. Capture your ideas using plain old text files. Make a journal that lasts 100 years.☆29Updated last year
- Dump (freeze) SQL query results from a database into a selection of file formats☆92Updated 6 years ago
- (Deprecated - please use https://github.com/gmarmstrong/python-datamuse) Python wrapper for the Datamuse API☆15Updated 7 years ago
- Socrates is a thin wrapper around an early-stage [AllenNLP](https://allennlp.org/) model that enables machine reading comprehension (MRC)…☆14Updated 4 years ago
- Pythonic package for combinatorics☆49Updated 3 years ago
- Take streaming tweets, extract hashtags & usernames, create graph, export graphml for Gephi visualisation☆38Updated 12 years ago
- A pipeline for detecting novel information about entities from a stream of text, updating a knowledge base about the entities, and genera…☆32Updated 6 years ago
- Python wrapper library for the Datamuse API☆80Updated 2 years ago
- Scraping Assisted by Learning☆35Updated 3 weeks ago
- Scraper built with Scrapy.☆18Updated last year
- A Python library to load structured table data from files/strings/URL with various data format: CSV / Excel / Google-Sheets / HTML / JSON…☆108Updated 2 years ago
- A spell-checker extending Peter Norvig's with multi-typo correction, hamming distance weighting, and more.☆98Updated 4 years ago
- Extract text from HTML☆134Updated 5 years ago
- A powerful command line interface for working with DBHub.io☆48Updated last year
- A generic crawler☆78Updated 7 years ago
- Simple python workflow engine based on asyncio and a DAG structure.☆62Updated 8 years ago