hackalog / easydata
A flexible template for doing reproducible data science in Python.
☆109Updated 10 months ago
Alternatives and similar repositories for easydata:
Users that are interested in easydata are comparing it to the libraries listed below
- Up Your Bus Number: A Primer for Reproducible Data Science☆68Updated 6 years ago
- 💫 PyScaffold extension for data-science projects☆157Updated 2 weeks ago
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆224Updated 4 years ago
- nbconflux converts Jupyter Notebooks to Atlassian Confluence pages☆125Updated 9 months ago
- Dockerized ML Cookiecutter☆72Updated 2 years ago
- Marshmallow Schema generator for Pandas DataFrames☆24Updated 4 years ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆75Updated last year
- A short tutorial for data scientists on how to write tests for code + data.☆119Updated 4 years ago
- Summarise and explore Pandas DataFrames☆99Updated 4 years ago
- A toolbox 🧰 for Jupyter notebooks 📙: testing, experiment tracking, debugging, profiling, and more!☆68Updated 6 months ago
- Reference package for unit tests☆49Updated 6 years ago
- Tries to shrink your Pandas column dtypes with no data loss so you have more spare RAM☆83Updated last year
- The easy way to write your own flavor of Pandas☆301Updated last month
- Fuzzy joins for python pandas - easily join different datasets☆59Updated 4 years ago
- Start a data science project with modern tools☆192Updated last year
- GitHub Action for testing notebooks☆152Updated 3 years ago
- A web frontend for scheduling Jupyter notebook reports☆252Updated 3 months ago
- Dockerfiles for images used as part of the Orbyter toolset☆44Updated 10 months ago
- Tutorial for implementing data validation in data science pipelines☆33Updated 2 years ago
- Automated Exploratory Data Analysis. Simplifying Data Exploration☆34Updated 4 years ago
- Dask tutorial material for video tutorial series☆87Updated last year
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆47Updated 2 weeks ago
- Pandas ExtensionDType/Array backed by Apache Arrow☆229Updated 2 years ago
- Automated Jupyter notebook testing. 📙☆41Updated last year
- The easiest way to integrate Kedro and Great Expectations☆53Updated 2 years ago
- A kedro-plugin for integration of mlflow capabilities inside kedro projects (especially machine learning model versioning and packaging)☆210Updated last week
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆53Updated 6 months ago
- Data exploration library with a pandas-like API☆74Updated 4 years ago
- Templates for jupyter notebooks☆142Updated last year
- Kedro-Accelerator speeds up pipelines by parallelizing I/O in the background.☆35Updated 2 years ago