openschemas / extractorsLinks
generic extraction recipes to get you started extracting schema.org entities for your software, data, and all things
☆14Updated 6 years ago
Alternatives and similar repositories for extractors
Users that are interested in extractors are comparing it to the libraries listed below
Sorting:
- Carles Pina Estany's 2020 Tool Fund: data managers and researchers collaborate to write the Frictionless Data packages, tabular schemas, …☆18Updated 2 years ago
- Data scraped by https://github.com/simonw/disaster-scrapers☆37Updated 2 years ago
- Open Data Portals and Sites around the world☆151Updated 3 months ago
- CKAN extension for data.gov.uk☆12Updated 2 weeks ago
- Data management service that brings continuous data validation to tabular data in your repository via Github Action☆42Updated last year
- Dataset files for the Open Data on GitHub paper☆31Updated 8 months ago
- Generate BigQuery tables, load and extract data, based on JSON Table Schema descriptors.☆18Updated 4 years ago
- Organization hierarchy - CKAN extension☆29Updated 5 months ago
- 🎉 A curated list of tools, libraries, patterns and projects in the Frictionless ecosystem.☆19Updated 4 years ago
- Remote harvesting extension for CKAN☆140Updated this week
- CKAN configuration settings available from env vars☆16Updated last year
- Scrapers for disaster data - writes to https://github.com/simonw/disaster-data☆50Updated last year
- Express Loader - quickly load data into DataStore. A replacement for DataPusher.☆54Updated 3 weeks ago
- The Federal Election Commission's web-based application that makes regulations easier to find, read and understand.☆34Updated last year
- web app for visualizing Wikidata items on a timeline☆16Updated 6 years ago
- Government of Canada CKAN Extension - Extension à CKAN du Gouvernement du Canada☆60Updated this week
- Resources for open data and enterprise data inventory management☆73Updated last week
- data - command line tool for working with data, Data Packages and the DataHub☆64Updated 2 years ago
- An easy interface for documenting data packages☆19Updated 7 years ago
- github action to run pandoc, soft-deprecated ->☆43Updated 5 years ago
- data.gov extension☆41Updated last month
- 💨🥫 A Data Factory system for running data processing pipelines built on AirFlow and tailored to CKAN. Includes evolution of DataPusher …☆33Updated 3 months ago
- Load shapefiles into a SQLite (optionally SpatiaLite) database☆32Updated 2 years ago
- Open source and open knowledge (data and content) licenses together with API and web service.☆68Updated last year
- Docker images and compose environment for local development and testing of ckan-cloud☆24Updated last year
- ☆17Updated 6 years ago
- Run Datasette on AWS serverless.☆18Updated 5 years ago
- Plugin for Intake to read from SQL servers☆15Updated 2 years ago
- The main repository of the Frictionless Data project. Website, issues, and discussions☆141Updated 3 months ago
- ☆10Updated 3 months ago