A short tutorial for data scientists on how to write tests for code + data.
☆121Aug 24, 2020Updated 5 years ago
Alternatives and similar repositories for data-testing-tutorial
Users that are interested in data-testing-tutorial are comparing it to the libraries listed below
Sorting:
- Pandas in black and white: a collection of opinionated pandas flashcards☆14Feb 15, 2019Updated 7 years ago
- ☆14Feb 8, 2016Updated 10 years ago
- Materials for dask talk at PyData NYC☆15Nov 11, 2015Updated 10 years ago
- Doing Bayesian statistics in Python!☆68Jan 31, 2018Updated 8 years ago
- ☆31Oct 20, 2015Updated 10 years ago
- Containing codes of participation in Kaggle competitions.☆37Mar 7, 2016Updated 9 years ago
- A data science Python library aimed at adding fuzz, noise and other issues to your data for testing purposes.☆30Mar 30, 2023Updated 2 years ago
- FDMNES☆10Jan 28, 2021Updated 5 years ago
- An introduction to network analysis and applied graph theory using Python and NetworkX☆1,098Feb 3, 2026Updated 3 weeks ago
- Analysis & Implementation of deep learning papers☆12Jan 26, 2018Updated 8 years ago
- Up Your Bus Number: A Primer for Reproducible Data Science☆69Feb 23, 2019Updated 7 years ago
- A library for defensive data analysis.☆502Jan 6, 2020Updated 6 years ago
- ☆11Jul 8, 2015Updated 10 years ago
- Pandas pipeline in graphviz☆11Mar 28, 2022Updated 3 years ago
- Notes and code from a network biology study group at UMD.☆10Apr 11, 2015Updated 10 years ago
- ARCHIVED 'NEON' 'API' Client☆11May 10, 2022Updated 3 years ago
- common data analysis and machine learning tasks using python☆33May 18, 2016Updated 9 years ago
- The easiest way to integrate Kedro and Great Expectations☆54Dec 26, 2022Updated 3 years ago
- Birgitta is a Python ETL test and schema framework, providing automated tests for pyspark notebooks/recipes.☆14Nov 9, 2023Updated 2 years ago
- Code for the Kaggle Marinexplore challenge☆17Apr 8, 2013Updated 12 years ago
- Tutorial session from PyData London, Fri 6 May 2016☆11May 6, 2016Updated 9 years ago
- Trigger the Google Genomics Pipeline API with CWL☆11Feb 7, 2017Updated 9 years ago
- R package experiment☆14Apr 8, 2022Updated 3 years ago
- NetworkL is a Python package which extends the scope of the NetworkX package to (L)arge time-varying graphs. It supports the manipulation…☆28May 11, 2021Updated 4 years ago
- Getting a better understanding of Black-Litterman and how Betterment manages my ETF portfolio.☆14Jul 29, 2015Updated 10 years ago
- Rstudio addin for version control and assignment management using git☆18Apr 7, 2025Updated 10 months ago
- Cookiecutter Dash is a tool to start developing Dash apps quickly.☆17Dec 17, 2018Updated 7 years ago
- A machine learning toolkit for economists☆18Dec 4, 2015Updated 10 years ago
- Intro to Testing in Data Science Tutorial☆36Apr 1, 2022Updated 3 years ago
- pybroom, the python's broom to tidy up messy fit results!☆14Jun 21, 2017Updated 8 years ago
- Dataclass with data validation. Checks the value of its fields by their annotations.☆13Jan 7, 2021Updated 5 years ago
- Presentations from meetups and conferences☆18Sep 4, 2020Updated 5 years ago
- Software Lab for Advanced Machine Learning with Stochastic Algorithms in Julia☆14Dec 21, 2017Updated 8 years ago
- Building Python Data Applications with Blaze and Bokeh Tutorial, SciPy 2015☆144Jul 8, 2015Updated 10 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆41Oct 22, 2019Updated 6 years ago
- Tools for Educators Writing Assignments in RMarkdown☆41Nov 14, 2023Updated 2 years ago
- Amazon access control challenge☆25Jun 21, 2014Updated 11 years ago
- PyMix - The Python mixture package☆16Nov 9, 2015Updated 10 years ago
- Materials for the Wednesday Afternoon Machine Learning workshop☆19Jun 5, 2015Updated 10 years ago