A workspace to experiment with Apache Spark, Livy, and Airflow in a Docker environment.
☆38Mar 29, 2021Updated 5 years ago
Alternatives and similar repositories for spark-livy-on-airflow-workspace
Users that are interested in spark-livy-on-airflow-workspace are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Spark Standalone & Livy☆11Jul 13, 2021Updated 4 years ago
- Docker with Airflow and Spark standalone cluster☆263Aug 5, 2023Updated 2 years ago
- Docker with Airflow + Postgres + Spark cluster + JDK (spark-submit support) + Jupyter Notebooks☆24Apr 2, 2022Updated 4 years ago
- Dockerizing and Consuming an Apache Livy environment☆13Jun 29, 2022Updated 3 years ago
- Simulating safety and non-safety messages in IEEE 1609.4. Tech Stack : Linux 12.04, Omnet++ 4.6, SUMO 0.22.0, Veins 4 alpha 2, Inet 2.5☆12Jul 19, 2017Updated 8 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆41Jan 24, 2023Updated 3 years ago
- Upload watson time logs to Jira Tempo worklogs☆10Feb 15, 2023Updated 3 years ago
- Implementation of an ETL process for real-time sentiment analysis of tweets with Docker, Apache Kafka, Spark Streaming, MongoDB and Delta…☆19May 6, 2023Updated 2 years ago
- Apache Spark enhanced with native Kubernetes scheduler back-end☆15Aug 21, 2023Updated 2 years ago
- A GitHub repo with materials for preparing for DP-420: Microsoft Certified: Azure Cosmos DB Developer Specialty certification Exam.☆17Jul 16, 2024Updated last year
- ☆24Dec 21, 2020Updated 5 years ago
- Apache Spark based command line tools for ElasticSearch☆10Sep 20, 2021Updated 4 years ago
- Presenting 3 ways to run Spark over containers, this project is recommended to those who seek to explore Big Data out of a Hadoop Cluster…☆11Nov 25, 2020Updated 5 years ago
- Building a Modern Data Lake with Minio, Spark, Airflow via Docker.☆23May 11, 2024Updated last year
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Prova de conceito - Springboot, Java, Schema Registry, Apache Avro e Apache Kafka .☆14Apr 18, 2023Updated 2 years ago
- Learn Apache Spark in Scala, Python (PySpark) and R (SparkR) by building your own cluster with a JupyterLab interface on Docker.☆507Nov 7, 2025Updated 5 months ago
- A batch processing data pipeline, using AWS resources (S3, EMR, Redshift, EC2, IAM), provisioned via Terraform, and orchestrated from loc…☆23May 14, 2022Updated 3 years ago
- A simple spark standalone cluster for your testing environment purposses☆568Mar 6, 2024Updated 2 years ago
- Online Anomaly Detection for HPC Performance Data☆11Jun 25, 2018Updated 7 years ago
- Rest API for Todobackend on top of Cassandra☆26Feb 22, 2023Updated 3 years ago
- Example of orchestrating dependent Databricks jobs using Airflow☆11Dec 19, 2019Updated 6 years ago
- Vault Plugin: Google Cloud Platform CA Service☆17Jul 20, 2021Updated 4 years ago
- Project files for the post: Running PySpark Applications on Amazon EMR using Apache Airflow: Using the new Amazon Managed Workflows for A…☆41Jul 6, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Multilable classification of legal documents (Eur-Lex)☆14Apr 9, 2021Updated 5 years ago
- Docker image to submit Spark applications☆38Jan 15, 2018Updated 8 years ago
- ☆24Dec 4, 2023Updated 2 years ago
- Library which aim to generate kubernetes yamls templates from an Airflow dag using the Airflow Kuberntes Pod Operator☆10May 6, 2021Updated 4 years ago
- The Coherence Python Client allows Python applications to act as cache clients to an Oracle Coherence cluster using gRPC as the network t…☆12Mar 26, 2026Updated 2 weeks ago
- spark on kubernetes☆104Feb 20, 2023Updated 3 years ago
- A docker image for HDFS FileBrowser. Cloudera Hue with FileBrowser only.☆11Sep 20, 2018Updated 7 years ago
- Code for the post, 'Getting Started with IoT Analytics on AWS'☆14Aug 27, 2020Updated 5 years ago
- Mastering Convolutional Neural Networks☆11Sep 14, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Repository for building docker image, with open-source applications☆26Apr 23, 2024Updated last year
- This repository is for demonstrating the capability to do SQL-based UPDATES, DELETES, and INSERTS directly in the Data Lake using Amazon …☆18Aug 25, 2021Updated 4 years ago
- Vagrant multiple host configuration☆14Dec 7, 2016Updated 9 years ago
- ☆20Aug 10, 2021Updated 4 years ago
- Fully automated provisioner for servers using MCollective☆44Jan 31, 2013Updated 13 years ago
- Hyper-Scale Machine Learning with MinIO and TensorFlow☆11Mar 25, 2023Updated 3 years ago
- Deploying a Machine Learning model streaming application with Apache Kafka☆11Aug 21, 2022Updated 3 years ago