sensiblecodeio / databaker
Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.
☆78Updated 9 months ago
Related projects: ⓘ
- Navigating around a grid of cells like XPath for spreadsheets; supports Python 3.5+☆47Updated last year
- A simple command line interface to the datamade/dedupe library.☆42Updated last year
- R Shiny App created to predict the success rate of Freedom of Information Act requests.☆16Updated 6 years ago
- https://www.washingtonpost.com/graphics/2020/investigations/helicopter-protests-washington-dc-national-guard/☆23Updated 4 years ago
- Core library for the datakit CLI framework.☆53Updated last year
- Machine assisted dossiers☆19Updated 6 years ago
- Basic cookiecutter template for Python projects☆17Updated 8 months ago
- Schemas to convert common fixed-width file formats into CSV using in2csv.☆123Updated 3 years ago
- Provide partial dates and retain the date precision through processing☆13Updated last year
- Data on newspaper presidential endorsements☆28Updated 3 years ago
- Parser and standardizer for politician, individual and organization names.☆128Updated 7 years ago
- Data validation as a service. Project retired, got to the current one at frictionsless/repository☆69Updated last year
- A maximum-strength name parser for record linkage.☆29Updated last month
- A repository of materials for a proposed class on automated story bots.☆50Updated 6 years ago
- Generate SQL tables, load and extract data, based on JSON Table Schema descriptors.☆60Updated last year
- Framework for processing data packages in pipelines of modular components.☆118Updated last year
- Data Pipes for CSV☆117Updated last year
- A build tool for data projects.☆48Updated 2 years ago
- Investigative tool for extracting relevant areas from many documents☆14Updated 8 years ago
- Exploring sequential data with a sankey diagram☆22Updated last year
- Python and pandas tools to perform various analyses on different types of word lists☆16Updated 9 years ago
- Monitor datasets, gets alerts when something happens☆213Updated 5 years ago
- Tools for generating CSV and other flat versions of the structured data☆103Updated this week
- Data and experiments with world population densities for comparison to addresses☆13Updated 7 years ago
- Datakit plugin to help manage Github integration on data projects.☆12Updated last year
- The fastest, cleanest, most reproducible ways to OCR a document.☆27Updated 5 years ago
- Dexter document monitor for MMA☆17Updated 4 months ago
- ☆37Updated this week
- The EU structural funds datasets on regional and national level (in progress).☆26Updated last year
- Python library and command line tool for converting data from one format to another☆100Updated 4 years ago