Docker-Hub-frolvlad / docker-alpine-python-machinelearning
Small Docker image with Python Machine Learning tools (~180MB) https://hub.docker.com/r/frolvlad/alpine-python-machinelearning/
☆80Updated 3 weeks ago
Alternatives and similar repositories for docker-alpine-python-machinelearning:
Users that are interested in docker-alpine-python-machinelearning are comparing it to the libraries listed below
- T4 is now in production as Quilt 3☆64Updated 5 years ago
- Docker container to make running Luigi tasks real easy.☆11Updated 8 years ago
- Read and write Python objects to S3, caching them on your hard drive to avoid unnecessary IO.☆24Updated 7 years ago
- An opinionated template for spinning up a dask cluster based on docker.☆14Updated 7 years ago
- A project to help develop Luigi pipelines using Docker ✳️☆79Updated 4 years ago
- CLI tool to launch Spark jobs on AWS EMR☆67Updated last year
- ☆54Updated 7 years ago
- ☕⛵WIP PySpark dependency management☆22Updated 6 years ago
- Deprecated, please use https://github.com/jcrist/skein or https://github.com/dask/dask-yarn instead☆52Updated 6 years ago
- Scripts and instructions to facilitate running Deep Learning Tasks on Amazon EMR☆63Updated last year
- Example for an airflow plugin☆49Updated 8 years ago
- An extension for Jupyter notebooks that allows running notebooks inside a Docker container and converting them to runnable Docker images.☆28Updated last year
- Demonstration of using an Argo workflow for an ML application☆28Updated 6 years ago
- Example unit tests for Apache Spark Python scripts using the py.test framework☆84Updated 9 years ago
- PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection☆18Updated 8 years ago
- Python AWS Kinesis Producer with error handling and thread support.☆45Updated 2 years ago
- This repo demonstrates how to load a sample Parquet formatted file from an AWS S3 Bucket. A python job will then be submitted to a Apach…☆19Updated 8 years ago
- Pure Python wrapper for the Hadoop WebHDFS Rest API☆52Updated 4 years ago
- Create Parquet files from CSV☆67Updated 7 years ago
- An example to illustrate using Luigi to manage a data science workflow in Greenplum Database☆12Updated 6 years ago
- Deploy dask-distributed on google container engine using kubernetes☆40Updated 6 years ago
- Open source Flotilla☆193Updated this week
- Data science tool for creating and deploying pipelines with versioned data☆45Updated 9 months ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- Functional Airflow DAG definitions.☆38Updated 7 years ago
- PySpark for Elastic Search☆55Updated 8 years ago
- tap-postgres☆68Updated 7 months ago
- Example stream processing job, written in Scala with Apache Beam, for Google Cloud Dataflow☆30Updated 8 years ago
- Required packages for using pandas in AWS Lambda functions☆45Updated 8 years ago
- Airflow plugin to transfer arbitrary files between operators☆78Updated 6 years ago