pudo-attic / archivekitLinks
ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.
☆15Updated 10 years ago
Alternatives and similar repositories for archivekit
Users that are interested in archivekit are comparing it to the libraries listed below
Sorting:
- LoadKit supports Extract, Transform, Load processes based on ArchiveKit buckets.☆11Updated 10 years ago
- A contextual news development environment.☆49Updated 10 years ago
- Simple type converters: make ints, floats, bools and dates from your strings!☆11Updated 8 years ago
- Simplifying the process of launching an open data repository. [RETIRED]☆20Updated 10 years ago
- Measure is scripts and conventions to build KPI dashboards for projects.☆17Updated 4 years ago
- Python library with common functionality for writing web scrapers☆102Updated 9 years ago
- Transform flat data structures into nested object graphs matching JSON schema definitions.☆28Updated 8 years ago
- A pastebin for tables.☆34Updated 11 years ago
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Updated 2 years ago
- A simple python-based abstraction library for the various blob storage out there including s3, google storage and local disk.☆32Updated 7 years ago
- Data notification service: subscribe to keywords and get notified whenever an open data sources mentions that keyword.☆24Updated 11 years ago
- Utilities for working with data.☆20Updated 10 years ago
- Manage and load dataprotocols.org Data Packages☆27Updated 9 years ago
- agate-charts adds exploratory charting support to agate.☆9Updated 3 months ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 9 years ago
- Machine assisted dossiers☆19Updated 7 years ago
- Re-usable wrapper scripts for text document extractors.☆37Updated 9 years ago
- An alpha project combining beneficial ownership and contracting data☆13Updated 4 years ago
- Archive of political ad data from the Federal Communications Commission☆20Updated 7 years ago
- legacy backend for Open States☆87Updated 5 years ago
- A tool to allow US addresses to be geocoded/georeferenced easily, without using Python or the command line or paid services or anything.☆18Updated 2 years ago
- Python language parser for a tabular format for structured metadata. http://metatab.org☆18Updated last year
- [DEPRECATED] Please use https://github.com/frictionlessdata/specs☆17Updated 7 years ago
- common data interchange format for document processing pipelines that apply natural language processing tools to large streams of text☆35Updated 8 years ago
- Provide partial dates and retain the date precision through processing☆13Updated 2 years ago
- Little JSON object want to be graphs, too!☆17Updated 9 years ago
- Make for data☆20Updated 6 years ago
- Versioned domain model. Python library for revisioning/versioning of databases.☆44Updated 4 years ago
- Plots various graphs for a series of plaintext files in a directory☆19Updated 9 years ago
- A simple transformation/data processing pipeline for CrisisNET☆15Updated 10 years ago