santinic / htmlmatchLinks
Python tool for automatic data scraping from Html templates
☆19Updated 9 years ago
Alternatives and similar repositories for htmlmatch
Users that are interested in htmlmatch are comparing it to the libraries listed below
Sorting:
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆23Updated 4 months ago
- A Scrapy crawler for http://books.toscrape.com☆27Updated 8 years ago
- A Python library to load structured table data from files/strings/URL with various data format: CSV / Excel / Google-Sheets / HTML / JSON…☆108Updated 2 years ago
- Colored symbols for various log levels for Python☆42Updated last year
- Automatically install missing Python modules using pip at import time.☆19Updated last year
- Binary Python bindings for poppler utils for content extraction☆42Updated 4 years ago
- The Python module that swims☆65Updated 2 years ago
- A scraper focused on organizational Github accounts and their members.☆42Updated 3 years ago
- Personal Knowledge Management System. Capture your ideas using plain old text files. Make a journal that lasts 100 years.☆29Updated last year
- Find the path of a key / value in a JSON hierarchy easily.☆97Updated 6 months ago
- Stickynotes for your desktop easily from the command line!☆37Updated 5 years ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆189Updated 3 years ago
- Advanced similarity and duplicate source code proof of concept for our research efforts.☆52Updated 3 years ago
- (Deprecated - please use https://github.com/gmarmstrong/python-datamuse) Python wrapper for the Datamuse API☆16Updated 7 years ago
- A powerful command line interface for working with DBHub.io☆47Updated last year
- Dump (freeze) SQL query results from a database into a selection of file formats☆91Updated 6 years ago
- Fast Indexed python HTML parser which builds a DOM node tree, providing common getElementsBy* functions for scraping, testing, modificati…☆102Updated 2 years ago
- Extract text from HTML☆134Updated 5 years ago
- Chunks of Python I've found useful.☆63Updated 5 years ago
- Dynamic web based reports/dashboards in Python☆116Updated 2 weeks ago
- Scraping Assisted by Learning☆35Updated last month
- A python instagram scraper which uses BeautifulSoup and JSON to scrape public instagram accounts☆27Updated 8 years ago
- 💡✏️️ ⬇️️ JSON to Markdown converter - Generate Markdown from format independent JSON☆76Updated 6 years ago
- Scraping tweets quickly using celery, RabbitMQ and Docker cluster☆50Updated 2 years ago
- ARGUS is an easy-to-use web scraping tool. The program is based on the Scrapy Python framework and is able to crawl a broad range of diff…☆89Updated 3 years ago
- Just is a wrapper to automagically read/write a file based on extension☆51Updated 3 months ago
- A Python command line tool that creates a Table of Contents for Markdown documents☆94Updated 7 years ago
- Python library for extracting text from various file formats (for indexing).☆113Updated 3 years ago
- Automatically exported from code.google.com/p/guess-language☆53Updated last year
- Programmable browser for functional black-box tests☆21Updated last month