alecstein / dolt_datascience
notebooks used to analysis projects
☆82Updated last year
Related projects: ⓘ
- ☆46Updated last year
- ☆116Updated last year
- Codd method-chained SQL generator and Pandas data processing in Python.☆114Updated 11 months ago
- ☆43Updated 2 years ago
- a model to generate estimates of the number of outstanding votes on an election night based on the current results of the race☆37Updated this week
- Linear regression in SQL using dbt☆64Updated 3 weeks ago
- A FiveThirtyEight/The Marshall Project effort to collect comprehensive data on police misconduct settlements from 2010-19.☆145Updated 2 years ago
- Validation tool to check output files required by the price-transparency-guide☆29Updated this week
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆107Updated 4 months ago
- Cohort extractor tool which can generate dummy data, or real data against OpenSAFELY-compliant research databases☆38Updated 3 weeks ago
- https://www.washingtonpost.com/graphics/2020/investigations/helicopter-protests-washington-dc-national-guard/☆23Updated 4 years ago
- Data exploration done quick.☆19Updated 3 years ago
- Scripts to make specific datasets cleaner and more convenient☆40Updated last year
- A maximum-strength name parser for record linkage.☆29Updated last month
- ☆61Updated 4 years ago
- General programming utilities from Pew Research Center☆69Updated 2 years ago
- Monitors the TSA Published Statistics, Downloads new PDF files and Saves as .json☆31Updated last month
- Federal Crime Data Standardization and Analysis — The Trace and BuzzFeed News☆35Updated 5 years ago
- ☆30Updated 7 months ago
- 2020-election-night-model☆58Updated 3 years ago
- Free, online book "Open Forensic Science in R." This book is for anyone looking to do forensic science analysis in a data-driven and open…☆40Updated 5 years ago
- A repo containing various data (demographics, employment, etc.) in JSON form.☆57Updated 2 years ago
- Source files for the Open, Transparent, and Reproducible Data Science Handbook☆48Updated 4 months ago
- Python package to assist working with Medicare data.☆14Updated last year
- The technical implementation guide for the tri-departmental price transparency rule.☆358Updated 4 months ago
- Add website scraping abilities to Datasette☆59Updated last year
- ☆56Updated this week
- How to use Jupyter notebooks and R markdown to create living documents and reproducible reports.☆49Updated last year
- A compute graph for loading and transforming OWID's data☆76Updated this week
- This connector is a dbt project that maps Medicare CCLF claims data to the Tuva Input Layer.☆12Updated 3 weeks ago