backtick-se / cowaitLinks
Containerized distributed programming framework for Python
☆53Updated 2 years ago
Alternatives and similar repositories for cowait
Users that are interested in cowait are comparing it to the libraries listed below
Sorting:
- Coming soon☆61Updated last year
- Repository for makeinga a GitHub Actions for deploying to Kubeflow.☆35Updated 3 years ago
- Set-oriented Operations in Pandas☆24Updated 5 years ago
- Batteries included toolkit for data engineering.☆34Updated 6 months ago
- Dask and Spark interactions☆21Updated 8 years ago
- ☁️ Export Ploomber pipelines to Kubernetes (Argo), Airflow, AWS Batch, SLURM, and Kubeflow.☆45Updated 3 months ago
- Arrow, pydantic style☆83Updated 2 years ago
- A utility tool to automate certain tasks with Jupyter notebooks.☆9Updated last year
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Slideshow template for Voilà based on RevealJS☆16Updated 3 years ago
- This repository is no longer maintained.☆25Updated 3 years ago
- Tools for MLflow☆37Updated last year
- Python binding for DataFusion☆59Updated 2 years ago
- Demonstration of using an Argo workflow for an ML application☆28Updated 6 years ago
- hooqu is a library built on top of Pandas-like Dataframes for defining "unit tests for data". This is a spiritual port of Apache Deequ to…☆29Updated 6 months ago
- A Delta Lake reader for Dask☆53Updated this week
- A helm chart for Prefect☆14Updated 5 years ago
- Distributed Bayesian Entity Resolution in Apache Spark☆57Updated 4 years ago
- 🚕 Self-contained demo using Redpanda, Materialize, River, Redis, and Streamlit to predict taxi trip durations☆45Updated 2 years ago
- An Python object protocol for projects to interchange data frame-like data without forcing pandas.DataFrame as the intermediary☆15Updated 5 years ago
- ☆29Updated last year
- Pandas Msgpack☆23Updated 2 years ago
- A framework for piping in python.☆73Updated 5 years ago
- general functions for your data .pipe()-lines.☆17Updated last year
- Notes and samples for Python performance talk☆9Updated 3 years ago
- Public repository for versioning machine learning data☆42Updated 3 years ago
- big data technologies comparisons for cleaning, manipulating and generally wrangling data in purpose of analysis and machine learning.☆65Updated 5 years ago
- Parametrize and run scripts as notebooks with jupytext and papermill☆17Updated 5 years ago
- Convert monolithic Jupyter notebooks 📙 into maintainable Ploomber pipelines. 📊☆78Updated 9 months ago
- Cloud provider cluster managers for Dask. Supports AWS, Google Cloud Azure and more...☆143Updated last week