MrDebugger / bs2json
A python3 module that converts your bs4 Tag into json object (dict)
☆14Updated 11 months ago
Alternatives and similar repositories for bs2json:
Users that are interested in bs2json are comparing it to the libraries listed below
- Lightweight library that converts a HTML webpage to JSON data using a template defined in JSON.☆21Updated 4 years ago
- API client for Aleph, supports bulk entity and document upload.☆28Updated 4 months ago
- A base library for building web scrapers for statistical data, and a helper ontology for (primarily Swedish) statistical data.☆13Updated last week
- Graph analysis project on Wikipedia categories & pages (Python Flask, Neo4j db, d3.js)☆8Updated 6 years ago
- A helper library full of URL-related heuristics.☆66Updated 5 months ago
- Where I keep my Python notes for starting projects☆9Updated 2 years ago
- Datasette plugin providing instructions for exporting data to Jupyter or Observable☆12Updated last year
- Python API for parsehub.com web scraping service☆45Updated 6 years ago
- Flask App - Argon Design System | AppSeed☆11Updated 4 years ago
- A scraping Master-slave system based on Google App Engine☆11Updated 4 years ago
- Scripts for Internet Archive☆12Updated 4 years ago
- Python ffmpeg wrapper for audio and video editing (trim, subtitles/overlay, concat, merge, & more!)☆23Updated 5 years ago
- A PDF classifier ensemble with REST API service☆23Updated 4 years ago
- Scraping Assisted by Learning☆35Updated last month
- scraper for facebook, gab, google and tiktok☆22Updated 8 months ago
- etl pipeline, graphical explorer and general toolbox for investigations with follow the money data☆15Updated last year
- ConTEXT Explorer is an open Web-based system for exploring and visualizing concepts (combinations of occurring words and phrases) over ti…☆9Updated 3 years ago
- Datasette plugin for uploading CSV files and converting them to database tables☆26Updated 11 months ago
- Automatically minify JS/CSS and compress all responses with brotli, defalte or gzip, with caching for static assets☆12Updated this week
- Word Religion Projections (2010-2050)☆14Updated 4 months ago
- Extract social media links and account names from websites.☆37Updated 4 years ago
- OpenRefine reconciler for Research Organization Registry☆13Updated this week
- A financial disclosure data extraction tool.☆13Updated last year
- A low-code microservices platform designed for legal engineers. Given a document, Gremlin will apply a series of Python scripts to it and…☆29Updated 2 years ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- A scraper focused on organizational Github accounts and their members.☆42Updated 2 years ago
- Scrapfly Python SDK for headless browsers and proxy rotation☆39Updated last month
- Ricgraph - Research in context graph☆27Updated this week
- An open-source toolkit for analyzing line-oriented JSON Twitter archives with Apache Spark.☆9Updated 2 months ago