leeper / data-versioningLinks
Collecting thoughts about data versioning
☆108Updated 6 years ago
Alternatives and similar repositories for data-versioning
Users that are interested in data-versioning are comparing it to the libraries listed below
Sorting:
- Material for some talks I have given☆61Updated last year
- Data Server for Topic Models☆122Updated 2 years ago
- Docker images for data science from Wise.io☆51Updated 9 years ago
- Code for Pythonic visualization blog post☆40Updated 8 years ago
- The Fallacy of Placing Confidence in Confidence Intervals☆37Updated 10 years ago
- Codebase for DIVE backend (server, worker, and ORM)☆158Updated 3 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 9 years ago
- Data Science box: Spark, Jupyter, R+RStudio, Zeppelin, Python 2 & 3, Java, Scala.☆39Updated 7 years ago
- Deprecated version of Voyager 2 (in Angular), please use https://github.com/vega/voyager.☆32Updated 8 years ago
- Quick informal survey at the Los Angeles Machine learning meetup about tools used for machine learning.☆51Updated 10 years ago
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- ggplot2-inspired d3 app to make instant interactive visualizations☆55Updated 13 years ago
- ☆67Updated 8 years ago
- A (comprehensive) collection of open source tools used by the data community.☆52Updated 10 years ago
- ☆46Updated 6 months ago
- Text Thresher crowd sourced text annotator☆17Updated 8 years ago
- A curated list of awesome Machine Learning frameworks, libraries and software.☆20Updated 10 years ago
- See https://github.com/tworavens/tworavens for current repository for this project and http://2ra.vn for project pages.☆29Updated 7 years ago
- A simple command line interface to the datamade/dedupe library.☆43Updated 3 years ago
- Randomly sample lines from a csv, tsv, or other line-based data file☆125Updated 10 years ago
- Data for the 2014 ebola outbeak in West Africa☆265Updated 7 years ago
- Git Wrapper for Dataset Management☆15Updated 2 years ago
- A browser based R Notebook☆125Updated 12 years ago
- Supervised learning for novelty detection in text☆78Updated 9 years ago
- PDF and python files for creating time maps and downloading tweets☆59Updated 5 years ago
- A framework (comand line tool + libraries) for creating flexible compute pipelines☆56Updated 5 years ago
- Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.☆81Updated 2 years ago
- A simple dataset of Stack Overflow questions and tags☆109Updated 8 years ago
- ☆24Updated 7 years ago