cantabular / databaker
Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.
☆80Updated last year
Alternatives and similar repositories for databaker:
Users that are interested in databaker are comparing it to the libraries listed below
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆48Updated 2 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- https://www.washingtonpost.com/graphics/2020/investigations/helicopter-protests-washington-dc-national-guard/☆23Updated 4 years ago
- Machine assisted dossiers☆19Updated 7 years ago
- Data Pipes for CSV☆116Updated 2 years ago
- Core library for the datakit CLI framework.☆55Updated 2 years ago
- ArchiveKit manages data and documents during ETL processes, either on a local file system or on S3.☆15Updated 10 years ago
- Data Driven Journalism Handbook☆21Updated 12 years ago
- Monitor datasets, gets alerts when something happens☆210Updated 6 years ago
- A place to collect and share knowledge about liberating data from PDFs☆54Updated 3 years ago
- America's most comprehensive dictionary of campaign finance jargon. A free resource created by and for data journalists.☆17Updated last week
- A repository of journalist's lookup tables.☆106Updated 8 years ago
- Parser and standardizer for politician, individual and organization names.☆129Updated 7 years ago
- A collection of Jupyter notebooks demonstrating ways to analyze Census data☆51Updated 7 years ago
- An SQL loader for datasets published via Socrata☆29Updated 2 years ago
- Provide partial dates and retain the date precision through processing☆13Updated 2 years ago
- Python language parser for a tabular format for structured metadata. http://metatab.org☆18Updated last year
- A repository of materials for a proposed class on automated story bots.☆49Updated 6 years ago
- Basic cookiecutter template for Python projects☆21Updated 7 months ago
- Binary Python bindings for poppler utils for content extraction☆42Updated 3 years ago
- A template for open-source Python repositories☆23Updated last week
- 📒 Analyzing Data, the DataMade Way☆37Updated 4 years ago
- Demonstration of how dedupe might be used as geocoder☆17Updated 2 years ago
- Twitter, quick. Fetch and store tweets on short notice.☆80Updated 8 years ago
- DocumentCloud's back end source code - Please report bugs, issues and feature requests to info@documentcloud.org☆39Updated this week
- Tribe extracts a network from an email mbox and writes it to a graphml file for visualization and analysis.☆79Updated 2 years ago
- Data validation as a service. Project retired, got to the current one at frictionsless/repository☆69Updated 2 years ago
- A simple Python wrapper for U.S. Census Geocoding Services API batch service☆42Updated 5 months ago
- A tool to allow US addresses to be geocoded/georeferenced easily, without using Python or the command line or paid services or anything.☆18Updated 2 years ago
- View, visualize, clean and process data in the browser.☆148Updated 6 years ago