leeper / data-versioning
Collecting thoughts about data versioning
☆108Updated 5 years ago
Alternatives and similar repositories for data-versioning:
Users that are interested in data-versioning are comparing it to the libraries listed below
- See https://github.com/tworavens/tworavens for current repository for this project and http://2ra.vn for project pages.☆30Updated 6 years ago
- Google Container Engine, JupyterHub, and Jupyter for classroom scenarios☆59Updated 7 years ago
- Tools for massively parallel and multi-variate data exploration☆39Updated 8 months ago
- A Jupyter Lab extension for rendering tabular data☆35Updated 6 years ago
- Git Wrapper for Dataset Management☆15Updated last year
- A Topic Modeling toolbox☆92Updated 8 years ago
- Algorithm's team Jupyter Notebooks☆113Updated 8 years ago
- Web application for creating publication ready charts and graphs☆56Updated 6 years ago
- A Python library for working with Data Packages.☆191Updated 10 months ago
- Simple validator for submissions to DrivenData competitions☆19Updated 5 years ago
- Bayesian Weighting for De-Biasing Thematic Maps☆54Updated 3 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- Deprecated version of Voyager 2 (in Angular), please use https://github.com/vega/voyager.☆33Updated 7 years ago
- Material for some talks I have given☆62Updated 4 months ago
- A Binder-compatible repo with a requirements.txt file☆26Updated 7 years ago
- Quick informal survey at the Los Angeles Machine learning meetup about tools used for machine learning.☆51Updated 9 years ago
- NLP pipeline software using common workflow language☆34Updated 5 years ago
- Repo for experiments on pyspark and sklearn☆79Updated 10 years ago
- My talk at Strata 2014 in Santa Clara, CA☆73Updated 10 years ago
- Code for Pythonic visualization blog post☆40Updated 7 years ago
- Tools, wrappers, etc... for data science with a concentration on text processing☆206Updated 2 years ago
- Multidimensional data explorer and visualization tool.☆55Updated 7 years ago
- Visual exploration of clustered data.☆46Updated 4 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- The Fallacy of Placing Confidence in Confidence Intervals☆37Updated 9 years ago
- Autoencoders to find structure in arbitrary datasets☆123Updated 9 years ago
- A repository for the "Combining DBpedia and Topic Modeling" GSoC 2016 idea☆13Updated 8 years ago
- ggplot2-inspired d3 app to make instant interactive visualizations☆55Updated 12 years ago
- Correlation matrix with scatter plot using d3.js☆19Updated 10 years ago
- Docker container with a PyData stack and JupyterHub server☆37Updated 8 years ago