alecstein / dolt_datascience
notebooks used to analysis projects
☆83Updated 2 years ago
Alternatives and similar repositories for dolt_datascience:
Users that are interested in dolt_datascience are comparing it to the libraries listed below
- ☆48Updated last year
- Linear regression in SQL using dbt☆68Updated 2 months ago
- Codd method-chained SQL generator and Pandas data processing in Python.☆117Updated last year
- ☆43Updated 2 years ago
- ☆116Updated last year
- Validation tool to check output files required by the price-transparency-guide☆32Updated 3 months ago
- Tracking PG&E outages☆55Updated 2 years ago
- Scripts to make specific datasets cleaner and more convenient☆41Updated 2 years ago
- Tool for probabilistically linking the records of individual entities (e.g. people) within and across datasets☆111Updated 3 months ago
- Data exploration done quick.☆19Updated 3 years ago
- Federal Crime Data Standardization and Analysis — The Trace and BuzzFeed News☆35Updated 6 years ago
- This connector is a dbt project that maps Medicare CCLF claims data to the Tuva Input Layer.☆13Updated 3 months ago
- 2020-election-night-model☆58Updated 4 years ago
- A repo containing various data (demographics, employment, etc.) in JSON form.☆64Updated 2 years ago
- Data and Codes for GroceryDB☆139Updated 2 months ago
- How to use Jupyter notebooks and R markdown to create living documents and reproducible reports.☆49Updated 2 years ago
- A Go program to split large JSON files into many jsonl files☆62Updated 2 years ago
- Load Illinois political contribute and spending data efficiently☆56Updated 3 years ago
- Scout is a data discovery tool to explore open data portals worldwide.☆34Updated 4 months ago
- Easy and flexible data contracts☆125Updated last month
- A maximum-strength name parser for record linkage.☆36Updated last month
- https://www.washingtonpost.com/graphics/2020/investigations/helicopter-protests-washington-dc-national-guard/☆23Updated 4 years ago
- Cross-filter millions (or even billions) of data entries with no interaction delay☆97Updated last year
- Boring ML Generated Site☆19Updated 2 years ago
- Scrapers for disaster data - writes to https://github.com/simonw/disaster-data☆49Updated last year
- convert a scikit-learn decision tree into a Keras model☆39Updated last year
- Cohort extractor tool which can generate dummy data, or real data against OpenSAFELY-compliant research databases☆38Updated last month
- A serverless duckDB deployment at GCP☆38Updated 2 years ago
- Commuting zones are geographic areas where people live and work and are useful for understanding local economies, as well as how they dif…☆40Updated last year