Using python3.6 alpine base image adds java,pandas, numpy,pyspark and spark as rundeps. This image can be used as container image when you run spark-submit on k8.
β12Nov 11, 2022Updated 3 years ago
Alternatives and similar repositories for alpine-python3-numpy-pandas-sparkContainer-spark-submit
Users that are interested in alpine-python3-numpy-pandas-sparkContainer-spark-submit are comparing it to the libraries listed below
Sorting:
- π₯ͺπΎ A sample of data from the `jaffle-shop-generator` that powers the Jaffle Shop spanning one year.β15Jan 23, 2025Updated last year
- My Webinar at ODSC, Boston 2019 for humansβ10Mar 27, 2019Updated 6 years ago
- β10Apr 25, 2021Updated 4 years ago
- Playground site for creating/validating data contractsβ11Aug 9, 2025Updated 6 months ago
- prebuilt configurations for docker-rpm-builderβ11Feb 5, 2021Updated 5 years ago
- The Data Product Specificationβ11Jan 28, 2025Updated last year
- Architecture principlesβ13May 23, 2025Updated 9 months ago
- Github action for running python unit testsβ10Jun 16, 2025Updated 8 months ago
- Manage Unity Catalog tables with Pydantic Modelsβ10Mar 5, 2025Updated 11 months ago
- A collection of CMake modules to simplify the development of Boost libraries.β10Apr 16, 2012Updated 13 years ago
- Setup Apache Airflow on Kubernetesβ10Jul 20, 2018Updated 7 years ago
- β13Nov 14, 2013Updated 12 years ago
- A web application made using Python 3, Django 2, Bootstrap and REST API. It's website about technology where user can find interesting neβ¦β12Dec 8, 2022Updated 3 years ago
- Databricks dbt factory library for creating Databricks Job definition where individual dbt models are run as separate tasks.β20Jul 13, 2025Updated 7 months ago
- An implementation of Dijkstra in Clojureβ19Aug 7, 2012Updated 13 years ago
- β12Updated this week
- Extremely low-level wrapper to the MediaWiki APIβ27Mar 15, 2017Updated 8 years ago
- β11Feb 14, 2020Updated 6 years ago
- Recommendation System for Animeβ11Apr 15, 2024Updated last year
- Fixed-width data source for Spark SQL and DataFramesβ10Oct 25, 2016Updated 9 years ago
- Python solutions to problems posted on http://codility.com/β11Nov 13, 2013Updated 12 years ago
- β12Aug 9, 2024Updated last year
- learning-by-doing data model built with dbt-coreβ15Dec 13, 2025Updated 2 months ago
- β11Jul 20, 2023Updated 2 years ago
- Docker image for a Python installation with Spark, Hadoop and Sqoop binariesβ15Jan 26, 2018Updated 8 years ago
- β12Sep 26, 2019Updated 6 years ago
- PredictHQβs Data Science documentationβ14Feb 1, 2026Updated last month
- Knowledge sharing - Cheat sheetsβ20Feb 22, 2026Updated last week
- Udacity Data Engineer Nanodegree - Capstone projectβ11Dec 19, 2019Updated 6 years ago
- An sbt plugin to resolve dependencies using Aetherβ13Apr 10, 2025Updated 10 months ago
- Object Oriented Programming using Python and C++β13Oct 20, 2021Updated 4 years ago
- Deep Learning Udacity Nanodegree - SageMaker Deployment of a Sentiment Analysis modelβ10Apr 14, 2019Updated 6 years ago
- A web app that uses logarithmic regression to predict the outcome of tennis matches. Built with Python's Scikit-learn package and Flaskβ11Jan 3, 2015Updated 11 years ago
- nuclio integration and demos with NVIDIA RAPIDSβ13Feb 2, 2020Updated 6 years ago
- emacs customizations for stevejβ35Jun 8, 2015Updated 10 years ago
- CentOS docker images, build weekly with latest security updatesβ11Feb 23, 2026Updated last week
- Repo will try to cover all the most frequently used ML algos with proper explanation and examplesβ10Apr 14, 2019Updated 6 years ago
- Trino Iceberg Metadata Insights via Streamlitβ15Apr 9, 2025Updated 10 months ago
- MLeap demo repository for use with MLeap blog postsβ11Jul 13, 2016Updated 9 years ago