Build and run Spark Structured Streaming pipelines in Hadoop - project using PySpark.
☆13Jun 6, 2019Updated 6 years ago
Alternatives and similar repositories for spark-streaming-pyspark
Users that are interested in spark-streaming-pyspark are comparing it to the libraries listed below
Sorting:
- This is the first project where we worked on apache spark, In this project what we have done is that we downloaded the datasets from KAGG…☆22Oct 14, 2021Updated 4 years ago
- 📚 MIT Manipal Data Science Engineering: Your go-to resource hub for lab study materials, code, and more. Enhance your learning with this…☆10Sep 3, 2024Updated last year
- ☆35Jul 13, 2020Updated 5 years ago
- Improving the development of Spark applications deployed as jobs on AWS services like Glue and EMR☆10Jul 26, 2023Updated 2 years ago
- Latest version of GoFFish Distributed Graph Processing Platforms☆12Apr 30, 2018Updated 7 years ago
- Kubernetes Container Storage Interface (CSI) plug-in for Oracle ZFS Storage Appliance.☆14Jul 2, 2024Updated last year
- ☆11Jun 15, 2019Updated 6 years ago
- Files to Build a Docker Image for Facebook Prophet☆13Feb 7, 2019Updated 7 years ago
- ☆11Jun 12, 2019Updated 6 years ago
- Kubernetes Volume Snapshot Controller using Custom Resource Definition☆12Sep 20, 2017Updated 8 years ago
- ☆12Jan 25, 2018Updated 8 years ago
- This is an example of real time stream processing using Spark Streaming, Kafka & Elasticsearch.☆40Aug 31, 2016Updated 9 years ago
- Tomcat operator for Kubernetes☆12Mar 21, 2019Updated 6 years ago
- Collection of description of concepts, procedures, and simple XSLT files for text processing, e.g. simplify InDesign documents (.idml) to…☆12Jan 9, 2020Updated 6 years ago
- NVMesh Container Storage Interface (CSI) Driver for Kubernetes☆11Oct 7, 2024Updated last year
- Prediction of Premier League results using Machine Learning☆11Jul 11, 2024Updated last year
- Run Tensorflow and Keras with GPU support on Kubernetes☆13Mar 21, 2017Updated 8 years ago
- Privacy-preserving data sandbox for on-premise computation☆11Jun 15, 2021Updated 4 years ago
- 🎭 Sentiment Analysis with Neural Networks☆10Dec 4, 2016Updated 9 years ago
- Platzi - Curso Optimización de SQL☆14Jan 13, 2021Updated 5 years ago
- Errbot for Rocket.Chat - fork of unmaintained https://github.com/AoiKuiyuyou/AoikRocketChatErrbot☆12Feb 21, 2022Updated 4 years ago
- Code for the paper "Active learning for medical image segmentation with stochastic batches", published at Medical Image Analysis (2023).☆10Nov 14, 2024Updated last year
- containerized NFS Ganesha daemon☆10Aug 15, 2016Updated 9 years ago
- ☆14Feb 20, 2023Updated 3 years ago
- An example CI/CD pipeline using GitHub Actions for doing continuous deployment of AWS Glue jobs built on PySpark and Jupyter Notebooks.☆13Oct 15, 2020Updated 5 years ago
- ☆12Sep 17, 2019Updated 6 years ago
- ☆15Sep 10, 2022Updated 3 years ago
- This repository contains example patterns for storing large objects with DynamoDB.☆13Jun 19, 2024Updated last year
- Cloud Storage Kubernetes Operator with Go and Operator SDK☆12Nov 20, 2020Updated 5 years ago
- Using OPA Gatekeeper to deny admission or audit Istio and Istio-related objects☆12Nov 25, 2019Updated 6 years ago
- Simple tool in Python to help monitoring ram/cpu/io usage around ceph.☆10Jul 29, 2016Updated 9 years ago
- Kuberetes etcd network checkpointer