leeper / data-versioningLinks
Collecting thoughts about data versioning
☆108Updated 6 years ago
Alternatives and similar repositories for data-versioning
Users that are interested in data-versioning are comparing it to the libraries listed below
Sorting:
- Codebase for DIVE backend (server, worker, and ORM)☆158Updated 3 years ago
- Framework for processing data packages in pipelines of modular components.☆123Updated 6 months ago
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- A simple command line interface to the datamade/dedupe library.☆43Updated 3 years ago
- A framework (comand line tool + libraries) for creating flexible compute pipelines☆56Updated 4 years ago
- The Fallacy of Placing Confidence in Confidence Intervals☆37Updated 10 years ago
- Data Pipes for CSV☆115Updated 2 years ago
- Data validation as a service. Project retired, got to the current one at frictionsless/repository☆69Updated 3 years ago
- Text Thresher crowd sourced text annotator☆17Updated 8 years ago
- A Python library for working with Data Packages.☆191Updated last year
- Data Server for Topic Models☆122Updated 2 years ago
- A Jupyter Lab extension for rendering tabular data☆35Updated 7 years ago
- Generate SQL tables, load and extract data, based on JSON Table Schema descriptors.☆62Updated 2 years ago
- A Topic Modeling toolbox☆92Updated 9 years ago
- Code for Pythonic visualization blog post☆40Updated 8 years ago
- Git Wrapper for Dataset Management☆15Updated 2 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 9 years ago
- Tools to download and process name data from various sources.☆91Updated 12 years ago
- Quick informal survey at the Los Angeles Machine learning meetup about tools used for machine learning.☆51Updated 10 years ago
- A (comprehensive) collection of open source tools used by the data community.☆52Updated 10 years ago
- Material for some talks I have given☆62Updated last year
- ☆46Updated 5 months ago
- PDF and python files for creating time maps and downloading tweets☆59Updated 5 years ago
- DIT4C is a platform for hosting data analysis tools "in the cloud" using containers.☆40Updated 8 years ago
- Supervised learning for novelty detection in text☆78Updated 9 years ago
- Algorithm's team Jupyter Notebooks☆113Updated 7 months ago
- ☆67Updated 8 years ago
- Deprecated version of Voyager 2 (in Angular), please use https://github.com/vega/voyager.☆32Updated 8 years ago
- Inspired by John Foreman. Created by the crowds.☆54Updated 2 years ago
- An analysis of all 1.3 million public Jupyter Notebooks on Github in July 2017☆72Updated 7 years ago