leeper / data-versioning
Collecting thoughts about data versioning
☆108Updated 5 years ago
Alternatives and similar repositories for data-versioning:
Users that are interested in data-versioning are comparing it to the libraries listed below
- Tools for massively parallel and multi-variate data exploration☆39Updated 9 months ago
- See https://github.com/tworavens/tworavens for current repository for this project and http://2ra.vn for project pages.☆30Updated 6 years ago
- Codebase for DIVE backend (server, worker, and ORM)☆158Updated 2 years ago
- R client for the Enigma.io API - ABANDONED☆16Updated 7 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- ggplot2-inspired d3 app to make instant interactive visualizations☆55Updated 12 years ago
- A (comprehensive) collection of open source tools used by the data community.☆51Updated 9 years ago
- Data Science box: Spark, Jupyter, R+RStudio, Zeppelin, Python 2 & 3, Java, Scala.☆39Updated 6 years ago
- ☆66Updated 7 years ago
- The Fallacy of Placing Confidence in Confidence Intervals☆37Updated 9 years ago
- App for viewing visualizations created in Vega or Vega-lite☆88Updated 4 years ago
- Inspired by John Foreman. Created by the crowds.☆54Updated last year
- Google Container Engine, JupyterHub, and Jupyter for classroom scenarios☆59Updated 7 years ago
- Visual exploration of clustered data.☆46Updated 5 years ago
- Code for generating Value-Suppressing Uncertainty Palettes for use in D3 charts.☆77Updated 4 years ago
- beer recommendation engine project for Metis☆18Updated 2 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆206Updated 2 years ago
- A Javascript-only data library providing functionality like DataFrame in Pandas or R. (Currently in research phase - does this already ex…☆12Updated 7 years ago
- Bayesian Weighting for De-Biasing Thematic Maps☆54Updated 3 years ago
- Sample repo for luigi tasks & config☆36Updated 8 years ago
- Material for some talks I have given☆62Updated 5 months ago
- A script for rapidly sampling a proportion of lines from a file☆19Updated 9 years ago
- Framework for processing data packages in pipelines of modular components.☆120Updated 3 weeks ago
- A Topic Modeling toolbox☆92Updated 8 years ago
- Deprecated version of Voyager 2 (in Angular), please use https://github.com/vega/voyager.☆33Updated 7 years ago
- Multidimensional data explorer and visualization tool.☆55Updated 7 years ago
- A bare-bones version of the scrollytelling framework used in the Algorithms Tour☆53Updated 4 years ago
- (Deprecated) Task for the Search & Discovery data analyst job.☆21Updated 9 years ago
- ⚕ Tutorials for public health crossfilter dashboards☆46Updated 7 years ago
- T4 is now in production as Quilt 3☆64Updated 5 years ago