getindata / jupyter-imagesLinks

Receipes of publicly-available Jupyter images

☆8

Alternatives and similar repositories for jupyter-images

Users that are interested in jupyter-images are comparing it to the libraries listed below

Sorting:

getindata / datapill
Big Data Newsletter
☆23Updated last year
getindata / helm-charts
GetInData Helm Charts repository
☆12Updated 2 years ago
nezihyigitbasi / FlinkParquet
Using the Parquet file format (with Avro) to process data with Apache Flink
☆14Updated 9 years ago
Aiven-Open / guardian-for-apache-kafka
Set of tools for creating backups, compaction and restoration of Apache Kafka® Clusters
☆21Updated this week
cloudera-labs / cloudera.cluster
An Ansible collection for lifecycle and management of Cloudera CDP Private Cloud resources on bare metal, IaaS, and PaaS.
☆34Updated this week
davidgasquez / kubedbt
📆 Run, schedule, and manage your dbt jobs using Kubernetes.
☆24Updated 6 years ago
bernhard-42 / pyspark-atlas
PySpark for ETL jobs including lineage to Apache Atlas in one script via code inspection
☆18Updated 8 years ago
godatadriven-dockerhub / hive-metastore
Hadoop/Hive/Spark container to perform CI tests
☆11Updated 4 years ago
joerg-schneider / airtunnel
The sane way of building a data layer in Airflow
☆24Updated 5 years ago
cyanfr / dbvis_to_hortonworks_hiveserver2
Connect DBVisualizer to Hortonwork HiveServer2
☆9Updated 10 years ago
justhackit / spark-utils
☆10Updated 3 years ago
debezium / debezium-kubernetes
ARCHIVED: Run Debezium/KafkaConnect CDC components in Kubernetes
☆24Updated 6 years ago
OneCricketeer / kafka-connect-sandbox
Kafka Connect playground
☆10Updated 5 years ago
paypal / dione
Dione - a Spark and HDFS indexing library
☆52Updated last year
astronomer / terraform-google-astronomer-gcp
Intended for internal use: deploys all infrastructure required for Astronomer to run on GCP
☆10Updated 2 months ago
TorchAIKC / nifi-stateless-operator
An Operator for scheduling and executing NiFi Flows as Jobs on Kubernetes
☆53Updated 5 years ago
tlepple / data_origination_workshop
Hands-on workshop with Iceberg, Redpanda, Debezium and Kafka-Connect
☆13Updated 9 months ago
ExpediaGroup / shunting-yard
Shunting Yard is a real-time data replication tool that copies data between Hive Metastores.
☆20Updated 3 years ago
tspannhw / stocks-nifi-kafka
Stocks -> NiFi -> Kafka -> Profit
☆14Updated 6 years ago
anemos-io / protobeam
☆22Updated 6 years ago
lresende / ansible-kubernetes-cluster
Ansible roles to deploy Kubernetes, JupyterHub, Jupyter Enterprise Gateway and Spark on Kubernetes cluster
☆38Updated 4 years ago
ExpediaGroup / beekeeper
Service for automatically managing and cleaning up unreferenced data
☆46Updated last week
BrooksIan / Flink2Kafka
A Flink applcation that demonstrates reading and writing to/from Apache Kafka with Apache Flink
☆20Updated last year
hortonworks / cloudbreak-images
Saltstack scripts to bake amazon/gcc/azure/openstack images suitable for Cloudbreak
☆14Updated this week
AbsaOSS / spline-getting-started
☆25Updated 10 months ago
cetic / fadi
FADI - Ingest, store and analyse big data flows
☆46Updated last year
cloudera-labs / cloudera.cloud
cloudera.cloud - an Ansible collection for Cloudera Data Platform (CDP) for Public and Private Cloud
☆20Updated 2 weeks ago
claudiofahey / isilon-hadoop-tools
Tools to deploy Hadoop on EMC Isilon
☆17Updated 8 years ago
jupyterhub / jupyterhub-on-hadoop
Documentation and resources for deploying JupyterHub on Hadoop
☆19Updated 5 years ago
youngwookim / awesome-presto
A curated list of awesome PrestoDB / Trino software, libraries, tools and resources
☆17Updated 4 years ago