pudo-attic / archivekitLinks
ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.
☆15Updated 10 years ago
Alternatives and similar repositories for archivekit
Users that are interested in archivekit are comparing it to the libraries listed below
Sorting:
- A contextual news development environment.☆49Updated 11 years ago
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Updated 3 years ago
- ☆23Updated 10 years ago
- Python library with common functionality for writing web scrapers☆102Updated 10 years ago
- Make for data☆21Updated 7 years ago
- Easily crowdsource the analysis of your documents☆102Updated 8 years ago
- NPR Visual's Carebot (deprecated, now in: https://github.com/thecarebot/carebot)☆15Updated 10 years ago
- An ArchieML parser for Python☆11Updated 10 years ago
- Code for Newslynx App☆22Updated 10 years ago
- Manage and load dataprotocols.org Data Packages☆27Updated 10 years ago
- An alpha project combining beneficial ownership and contracting data☆13Updated 4 years ago
- legacy backend for Open States☆87Updated 6 years ago
- Re-usable wrapper scripts for text document extractors.☆37Updated 9 years ago
- A data processing pipeline that schedules and runs content harvesters, normalizes their data, and outputs that normalized data to a varie…☆42Updated 9 years ago
- [DEPRECATED] Please use https://github.com/frictionlessdata/specs☆17Updated 8 years ago
- Parser and standardizer for politician, individual and organization names.☆128Updated 8 years ago
- Open source tool to help journalists easily mash up data based on shared geography.☆59Updated 10 years ago
- Python library and command line tool for converting data from one format to another☆99Updated 5 years ago
- Transform flat data structures into nested object graphs matching JSON schema definitions.☆28Updated 9 years ago
- Tools for tracking stories on news homepages☆48Updated 6 years ago
- Python client library for controlling Google Refine☆83Updated 8 years ago
- How can we improve name matching in screening tools?☆15Updated 5 months ago
- Open Knowledge coding standards and style guide.☆35Updated 6 years ago
- Akara is an open-source (Apache2 license) Web framework specialized for RESTful data services, especially involving XML and other semi-st…☆25Updated 12 years ago
- OpenBlock is a web application and RESTful service that allows users to browse and search their local area for "hyper-local news☆61Updated 4 years ago
- An attempt at creating a gold standard dataset for backtesting yesterday & today's content-extractors☆35Updated 10 years ago
- Docker-based CKAN environments, with bells and whistles☆70Updated 8 years ago
- Data Pipes for CSV☆115Updated 3 years ago
- Generate SQL tables, load and extract data, based on JSON Table Schema descriptors.☆62Updated 2 years ago
- Monitor datasets, gets alerts when something happens☆210Updated 7 years ago