criteo / mlflow-yarnLinks
Backend implementation for running MLFlow projects on Hadoop/YARN.
☆11Updated 2 years ago
Alternatives and similar repositories for mlflow-yarn
Users that are interested in mlflow-yarn are comparing it to the libraries listed below
Sorting:
- Documentation and resources for deploying JupyterHub on Hadoop☆19Updated 6 years ago
- Sparrow is a boosting algorithm implementation that is optimized for training on very large datasets and/or in the limited memory setting…☆21Updated 4 years ago
- Spawn JupyterHub single user notebook servers in Hadoop/YARN containers.☆19Updated 4 months ago
- A Python package that parses sql and converts it to ibis expressions☆55Updated last year
- An extension for Jupyter Lab & Jupyter Notebook to monitor Apache Spark (pyspark) from notebooks☆55Updated 2 months ago
- Unified Distributed Execution☆56Updated 10 months ago
- Repository for makeinga a GitHub Actions for deploying to Kubeflow.☆35Updated 3 years ago
- The open-source Useful SDK. One python decorator in the Useful library allows for full observability of Python functions within an ETL.☆19Updated last year
- Unified slicing for all Python data structures.☆36Updated last month
- ☆37Updated 6 years ago
- Kedro Plugin to support running workflows on Kubeflow Pipelines☆55Updated 2 months ago
- A library on top of either pex or conda-pack to make your Python code easily available on a cluster☆45Updated this week
- Dockerized setup for testing code on realistic hadoop clusters☆27Updated 5 years ago
- Distributed persistent Task Queue running on Dask☆38Updated 2 years ago
- Package to extend Airflow functionality with CWL v1.0 support☆12Updated 6 years ago
- 🎯 aimrocks 🎸 — python & cython bindings for RocksDB. Batteries included! 🔋☆32Updated last month
- A conda-smithy repository for python-duckdb.☆13Updated 2 months ago
- IbisML is a library for building scalable ML pipelines using Ibis.☆115Updated last month
- The deepr module provide abstractions (layers, readers, prepro, metrics, config) to help build tensorflow models on top of tf estimators☆53Updated last year
- A JupyterLab extension providing, SQL formatter, auto-completion, syntax highlighting, Spark SQL and Trino☆89Updated last week
- ☆27Updated 2 years ago
- A ContentsManager wrapper for using multiple ContentsManager in Jupyter☆27Updated last year
- Mirror of Apache Arrow site☆37Updated this week
- Distributed XGBoost on Ray☆149Updated last year
- Spawn JupyterHub single-user servers with ssh☆27Updated 2 years ago
- Configurable event-logging for Jupyter applications and extensions.☆50Updated last year
- A software engineering framework to jump start your machine learning projects☆37Updated last year
- dask-pytorch-ddp is a Python package that makes it easy to train PyTorch models on dask clusters using distributed data parallel.☆59Updated 4 years ago
- ☆78Updated 4 years ago
- Ray-based Apache Beam runner☆41Updated 2 years ago