pudo-attic / archivekit
ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.
☆15Updated 9 years ago
Alternatives and similar repositories for archivekit:
Users that are interested in archivekit are comparing it to the libraries listed below
- LoadKit supports Extract, Transform, Load processes based on ArchiveKit buckets.☆11Updated 9 years ago
- Simplifying the process of launching an open data repository. [RETIRED]☆20Updated 10 years ago
- Simple type converters: make ints, floats, bools and dates from your strings!☆11Updated 8 years ago
- A contextual news development environment.☆49Updated 10 years ago
- Data notification service: subscribe to keywords and get notified whenever an open data sources mentions that keyword.☆24Updated 11 years ago
- A tool to allow US addresses to be geocoded/georeferenced easily, without using Python or the command line or paid services or anything.☆17Updated 2 years ago
- Provide partial dates and retain the date precision through processing☆13Updated 2 years ago
- Re-usable wrapper scripts for text document extractors.☆37Updated 8 years ago
- Utilities for working with data.☆20Updated 9 years ago
- A pastebin for tables.☆34Updated 11 years ago
- Python library with common functionality for writing web scrapers☆102Updated 9 years ago
- Transform flat data structures into nested object graphs matching JSON schema definitions.☆28Updated 8 years ago
- Archive of political ad data from the Federal Communications Commission☆20Updated 7 years ago
- Code for Newslynx App☆22Updated 9 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 9 years ago
- An alpha project combining beneficial ownership and contracting data☆13Updated 3 years ago
- An ArchieML parser for Python☆11Updated 9 years ago
- Tools for working with Optical Character Recognition output☆16Updated 11 years ago
- Measure is scripts and conventions to build KPI dashboards for projects.☆17Updated 4 years ago
- Machine assisted dossiers☆19Updated 7 years ago
- How can we improve name matching in screening tools?☆12Updated last month
- Next-gen web application for public finance data warehouses, formerly OpenSpending☆57Updated 2 years ago
- A simple app to add OAuth-based authentication in front of an S3 bucket-based static website.☆11Updated 2 years ago
- Dat python client☆46Updated 8 years ago
- ☆23Updated 10 years ago
- Simple-to-use wrapper for accessing Google Spreadsheets in Python.☆24Updated 10 years ago
- Python library and command line tool for converting data from one format to another☆99Updated 4 years ago
- [DEPRECATED] Please use https://github.com/frictionlessdata/specs☆17Updated 7 years ago
- Code for extracting data from a large number of PDFs, particularly FCC political ad documents☆15Updated 7 years ago
- Scan a folder of document files of all types and extract the text into a CSV suitable for Overview☆26Updated 8 years ago