hackalog / easydata
A flexible template for doing reproducible data science in Python.
☆109Updated 9 months ago
Alternatives and similar repositories for easydata:
Users that are interested in easydata are comparing it to the libraries listed below
- Up Your Bus Number: A Primer for Reproducible Data Science☆68Updated 6 years ago
- Dockerized ML Cookiecutter☆72Updated 2 years ago
- Bulwark is a package for convenient property-based testing of pandas dataframes.☆224Updated 4 years ago
- Automatically export Jupyter notebooks to various file formats (.py, .html, and more) on save.☆75Updated last year
- 💫 PyScaffold extension for data-science projects☆157Updated last week
- Marshmallow Schema generator for Pandas DataFrames☆24Updated 4 years ago
- GitHub Action for testing notebooks☆152Updated 3 years ago
- A library that unifies the API for most commonly used libraries and modeling techniques for time-series forecasting in the Python ecosyst…☆151Updated last year
- Dockerfiles for images used as part of the Orbyter toolset☆44Updated 9 months ago
- Summarise and explore Pandas DataFrames☆99Updated 4 years ago
- Start a data science project with modern tools☆191Updated last year
- JupyterLab extension to create GitHub commits & pull requests☆117Updated 8 months ago
- It's all in the name☆76Updated last year
- nbconflux converts Jupyter Notebooks to Atlassian Confluence pages☆125Updated 9 months ago
- stratx is a library for A Stratification Approach to Partial Dependence for Codependent Variables☆65Updated 10 months ago
- Data Exploration in PySpark made easy - Pyspark_dist_explore provides methods to get fast insights in your Spark DataFrames.☆103Updated 5 years ago
- A short tutorial for data scientists on how to write tests for code + data.☆119Updated 4 years ago
- Data exploration library with a pandas-like API☆74Updated 4 years ago
- Tutorial covering a new workflow available going from pandas to scikit-learn☆40Updated 2 years ago
- The easy way to write your own flavor of Pandas☆301Updated 3 weeks ago
- Repository with code, notebook and slides for my talk at PyConDE & PyData Berlin 2019☆36Updated 2 years ago
- vtreat is a data frame processor/conditioner that prepares real-world data for predictive modeling in a statistically sound manner. Distr…☆121Updated 2 months ago
- Fuzzy joins for python pandas - easily join different datasets☆59Updated 4 years ago
- A web frontend for scheduling Jupyter notebook reports☆252Updated 3 months ago
- Primrose modeling framework for simple production models☆33Updated 11 months ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆138Updated last month
- Notebooks that support blog posts and tech talks on Dask / Coiled.☆47Updated last week
- A toolbox 🧰 for Jupyter notebooks 📙: testing, experiment tracking, debugging, profiling, and more!☆68Updated 5 months ago
- Materials for "Parallelizing Scientific Python with Dask"☆70Updated 6 years ago
- The goal of pandas-log is to provide feedback about basic pandas operations. It provides simple wrapper functions for the most common fun…☆215Updated 3 years ago