leeper / data-versioningLinks
Collecting thoughts about data versioning
☆108Updated 6 years ago
Alternatives and similar repositories for data-versioning
Users that are interested in data-versioning are comparing it to the libraries listed below
Sorting:
- Codebase for DIVE backend (server, worker, and ORM)☆158Updated 2 years ago
- Framework for processing data packages in pipelines of modular components.☆122Updated 5 months ago
- Deprecated version of Voyager 2 (in Angular), please use https://github.com/vega/voyager.☆32Updated 8 years ago
- Data validation as a service. Project retired, got to the current one at frictionsless/repository☆69Updated 2 years ago
- Extract place names from a URL or text, and add context to those names -- for example distinguishing between a country, region or city.☆62Updated 8 years ago
- Material for some talks I have given☆62Updated last year
- A simple command line interface to the datamade/dedupe library.☆42Updated 2 years ago
- Multidimensional data explorer and visualization tool.☆56Updated 8 years ago
- Codebase for DIVE SPA using React and Redux☆167Updated 7 years ago
- A Jupyter Lab extension for rendering tabular data☆35Updated 7 years ago
- App for viewing visualizations created in Vega or Vega-lite☆88Updated 5 years ago
- Open source Flotilla☆197Updated last week
- Run IPython, Pattern, NLTK, Pandas, NumPy, SciPy, Numba, Biopython inside Docker☆47Updated 11 years ago
- Quick informal survey at the Los Angeles Machine learning meetup about tools used for machine learning.☆51Updated 10 years ago
- DIT4C is a platform for hosting data analysis tools "in the cloud" using containers.☆40Updated 8 years ago
- Data Server for Topic Models☆122Updated 2 years ago
- An analysis of all 1.3 million public Jupyter Notebooks on Github in July 2017☆72Updated 7 years ago
- Tools to download and process name data from various sources.☆92Updated 12 years ago
- A Python library for working with Data Packages.☆191Updated last year
- A framework (comand line tool + libraries) for creating flexible compute pipelines☆56Updated 4 years ago
- The Fallacy of Placing Confidence in Confidence Intervals☆37Updated 10 years ago
- Command line tool to convert spreadsheets to databases, made for the UK's Office for National Statistics.☆80Updated last year
- Qualitative visualization of the data types of CSV files☆258Updated 11 years ago
- See https://github.com/tworavens/tworavens for current repository for this project and http://2ra.vn for project pages.☆29Updated 7 years ago
- Data Pipes for CSV☆115Updated 2 years ago
- ☆67Updated 8 years ago
- Generate SQL tables, load and extract data, based on JSON Table Schema descriptors.☆62Updated 2 years ago
- T4 is now in production as Quilt 3☆64Updated 6 years ago
- Data Science box: Spark, Jupyter, R+RStudio, Zeppelin, Python 2 & 3, Java, Scala.☆39Updated 7 years ago
- Git Wrapper for Dataset Management☆15Updated 2 years ago