MrDebugger / bs2json
A python3 module that converts your bs4 Tag into json object (dict)
☆14Updated last year
Alternatives and similar repositories for bs2json:
Users that are interested in bs2json are comparing it to the libraries listed below
- Python based Wikidata framework for easy dataframe extraction☆43Updated last year
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated 5 months ago
- A terminal user interface for searching google☆11Updated 3 years ago
- Processes data from images which are tagged with the specified Instagram tag.☆13Updated 11 years ago
- https://mimesniff.spec.whatwg.org/ implementation for Python☆13Updated last year
- Scrape various open data directories to create an index of what's available out there☆36Updated last month
- Generate Python data structures and XML parser from Xschema (Python 3 port)☆11Updated 10 years ago
- GitHub Action for isort, flake8 and black☆7Updated 3 years ago
- A scraper focused on organizational Github accounts and their members.☆42Updated 2 years ago
- Celery plugin to autoscale based on available CPU, memory, or other system attributes.☆11Updated 7 years ago
- A PDF classifier ensemble with REST API service☆23Updated 4 years ago
- Schema.org classes in pydantic☆66Updated 2 years ago
- Commons of stupid, simple Python micro functions. Pull requests very welcome.☆19Updated 2 years ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆19Updated last year
- how hard is it to get a list of all local news sites in the United States (LOL)☆8Updated 4 years ago
- A helper library full of URL-related heuristics.☆69Updated 2 weeks ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated last month
- python functions for applied use of schema.org☆36Updated 3 years ago
- Curated list of awesome software and resources for Senzing, The First Real-Time AI for Entity Resolution.☆57Updated 3 months ago
- Functional composable pipelines allowing clean separation of the business logic and its implementation☆11Updated 10 months ago
- Web interface for network analysis.☆21Updated 2 years ago
- 🎉 A curated list of tools, libraries, patterns and projects in the Frictionless ecosystem.☆19Updated 3 years ago
- Named-Entity Recognition extension for OpenRefine☆27Updated 2 years ago
- Python tool for automatic data scraping from Html templates☆19Updated 8 years ago
- Zyte Automatic Extraction integration for Scrapy☆56Updated 3 years ago
- A simple python tool that generates a requests/bs4 based web scraper☆26Updated 2 years ago
- data wrangling simplicity, complete audit transparency, and at speed☆34Updated 3 weeks ago
- 🎉 A curated list of all awesome things related to CKAN☆39Updated 2 years ago
- Data notification service: subscribe to keywords and get notified whenever an open data sources mentions that keyword.☆24Updated 11 years ago