This repository contains code for Spark Streaming
☆26Mar 11, 2021Updated 5 years ago
Alternatives and similar repositories for spark-streaming
Users that are interested in spark-streaming are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Jun 29, 2021Updated 4 years ago
- Template for Scala Spark with Unit Test☆13Jul 24, 2023Updated 2 years ago
- A shell script to automate the operations of sqoop☆11Mar 29, 2021Updated 5 years ago
- ☆28Jun 14, 2022Updated 4 years ago
- A four-day course on Python, the Scientific Python stack and PySpark, adapted from a training course I gave to one of our clients in Dece…☆10Feb 3, 2016Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆12Oct 22, 2022Updated 3 years ago
- PG-Diploma In Big Data Analytics from CDAC☆26Mar 16, 2019Updated 7 years ago
- Reusable Python classes that extend open source PySpark capabilities. Examples of implementation is available under notebooks of repo htt…☆13Nov 1, 2024Updated last year
- For my IBM Data Science Professional certificate capstone project in early 2020, I used pandas, the FourSquare API, Folium, and other Pyt…☆13Dec 31, 2020Updated 5 years ago
- A website made in hopes to recreate the no longer available Internet Wishlist. Open source project built completely by the community.☆14Dec 6, 2022Updated 3 years ago
- A Python FHIR specification parser and class generator☆19Nov 22, 2021Updated 4 years ago
- ☆15May 18, 2022Updated 4 years ago
- ☆11Mar 11, 2022Updated 4 years ago
- 📦 Starting box for Vagrant. Inside box Ubuntu 20.04 LTS with Git, Docker and Docker compose.☆19May 5, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- AWS LocalStack + Spark Cluster + Zeppelin [Docker]☆10Jul 6, 2022Updated 3 years ago
- Material for teaching vtk python☆17Jul 21, 2015Updated 10 years ago
- Repository for Microsoft Databricks Training Events - Hosted by BlueGranite☆15Aug 22, 2019Updated 6 years ago
- R package for tracking Covid19 cases in San Francisco☆12Apr 2, 2023Updated 3 years ago
- Apache Hadoop - Docker distribution based on CentOS 7 and Oracle Java 8☆12Feb 20, 2018Updated 8 years ago
- chrome extension☆18Jun 29, 2020Updated 5 years ago
- Cloudformation template for deploying Presto on AWS☆13Jul 20, 2020Updated 5 years ago
- Experimental next-generation Terraform SDK (prototype)☆11Feb 24, 2023Updated 3 years ago
- Sons da CPI - Luide Matos☆19Jun 24, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ETL (Extract, Transform and Load) with the Spark Python API (PySpark) and Hadoop Distributed File System (HDFS)☆17Dec 18, 2018Updated 7 years ago
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆104Sep 26, 2025Updated 8 months ago
- Add gevent support to DataStax Python Driver for Apache Cassandra☆11Jun 10, 2020Updated 6 years ago
- DevOps☆16May 17, 2021Updated 5 years ago
- ☆11Dec 14, 2015Updated 10 years ago
- Hands-On Scala Programming [Video], published by Packt☆13Oct 31, 2022Updated 3 years ago
- Apache-Spark based Data Flow(ETL) Framework which supports multiple read, write destinations of different types and also support multiple…☆26Jun 7, 2021Updated 5 years ago
- Next-Generation Sequencing (NGS) Data Processing Tool & Library☆10May 9, 2023Updated 3 years ago
- Ansible Playbook to create LAMP in CentOS 7 with Apache, MySQL, PHP.☆10Dec 28, 2018Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Embed program outputs in markdown☆17Jun 8, 2026Updated last week
- A custom AWS credential provider that allows your Hadoop or Spark application access S3 file system by assuming a role☆10Jan 9, 2026Updated 5 months ago
- TensorFlow Lite SSD on a Jetson Nano 28.5 FPS☆12Dec 27, 2021Updated 4 years ago
- ☆10Aug 7, 2023Updated 2 years ago
- Apache Airflow CI pipeline☆19Jun 12, 2019Updated 7 years ago
- Examples for Dashboards.jl☆16Feb 9, 2020Updated 6 years ago
- A webhook that integrates the W&B model registry with Modal Labs☆15Dec 24, 2023Updated 2 years ago