santinic / htmlmatchLinks
Python tool for automatic data scraping from Html templates
☆19Updated 9 years ago
Alternatives and similar repositories for htmlmatch
Users that are interested in htmlmatch are comparing it to the libraries listed below
Sorting:
- A Scrapy crawler for http://books.toscrape.com☆27Updated 8 years ago
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆23Updated 6 months ago
- A Python library to load structured table data from files/strings/URL with various data format: CSV / Excel / Google-Sheets / HTML / JSON…☆109Updated 2 years ago
- Automatically exported from code.google.com/p/guess-language☆54Updated last month
- Functional composable pipelines allowing clean separation of the business logic and its implementation☆11Updated 3 months ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 4 years ago
- Example nteract notebooks with links to execution on mybinder.org☆29Updated 2 years ago
- Scraping Assisted by Learning☆36Updated 2 months ago
- Colored symbols for various log levels for Python☆42Updated last year
- Personal Knowledge Management System. Capture your ideas using plain old text files. Make a journal that lasts 100 years.☆29Updated 2 years ago
- WarcMiddleware lets users seamlessly download a mirror copy of a website when running a web crawl with the Python web crawler Scrapy.☆47Updated 7 years ago
- Automatically install missing Python modules using pip at import time.☆19Updated last year
- [Show some by giving ], A place to post your python scripts which you think are awesome☆53Updated 2 years ago
- Trough: Big data, small databases.☆40Updated last year
- Python module to watch Twitter user pages or search-results.☆64Updated 11 years ago
- CoCrawler is a versatile web crawler built using modern tools and concurrency.☆191Updated 3 years ago
- Python pretty print on steroids☆32Updated last month
- Python package to detect and return RSS / Atom feeds for a given website. The tool supports major blogging platform including Wordpress, …☆21Updated 4 years ago
- Writing a Simple DSL in Python☆22Updated 8 years ago
- 💡✏️️ ⬇️️ JSON to Markdown converter - Generate Markdown from format independent JSON☆77Updated 6 years ago
- A Python command line tool that creates a Table of Contents for Markdown documents☆94Updated 7 years ago
- A scraper focused on organizational Github accounts and their members.☆43Updated last month
- Pyfilesystem2 for various archive filesystems☆18Updated 3 years ago
- A collection of useful Python tools☆156Updated last week
- Pythonic package for combinatorics☆49Updated 4 years ago
- API - extract a list of keywords from a text.☆18Updated 8 years ago
- A search engine built on the Unpaywall database☆20Updated last year
- Graph of related videos from YouTube☆111Updated 2 years ago
- A helper library full of URL-related heuristics.☆72Updated 2 months ago
- Bringing sanity to world of messed-up data☆66Updated 11 years ago