leeper / data-versioningLinks
Collecting thoughts about data versioning
☆108Updated 6 years ago
Alternatives and similar repositories for data-versioning
Users that are interested in data-versioning are comparing it to the libraries listed below
Sorting:
- Codebase for DIVE backend (server, worker, and ORM)☆158Updated 2 years ago
- Material for some talks I have given☆62Updated 11 months ago
- Data Server for Topic Models☆121Updated 2 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Docker images for data science from Wise.io☆50Updated 9 years ago
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- ☆67Updated 8 years ago
- Open source Flotilla☆195Updated last week
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- Codebase for DIVE SPA using React and Redux☆167Updated 7 years ago
- A script for rapidly sampling a proportion of lines from a file☆19Updated 10 years ago
- Deprecated version of Voyager 2 (in Angular), please use https://github.com/vega/voyager.☆32Updated 7 years ago
- Qualitative visualization of the data types of CSV files☆257Updated 10 years ago
- Algorithm's team Jupyter Notebooks☆113Updated 3 months ago
- A (comprehensive) collection of open source tools used by the data community.☆52Updated 9 years ago
- Quick informal survey at the Los Angeles Machine learning meetup about tools used for machine learning.☆51Updated 10 years ago
- A browser based R Notebook☆125Updated 12 years ago
- Google Container Engine, JupyterHub, and Jupyter for classroom scenarios☆59Updated 7 years ago
- Text Thresher crowd sourced text annotator☆17Updated 7 years ago
- ☆46Updated last month
- How data science is woven into the fabric of Stitch Fix☆169Updated 7 months ago
- An interactive tool for exploring large, tabular datasets.☆338Updated 6 years ago
- Inspired by John Foreman. Created by the crowds.☆54Updated last year
- Framework for processing data packages in pipelines of modular components.☆121Updated 2 months ago
- A Jupyter Lab extension for rendering tabular data☆35Updated 7 years ago
- T4 is now in production as Quilt 3☆64Updated 6 years ago
- The Fallacy of Placing Confidence in Confidence Intervals☆37Updated 9 years ago
- Modeling Social Data, Applied Mathematics, Columbia University (Spring 2015)☆33Updated 6 years ago
- Data Pipes for CSV☆116Updated 2 years ago