Chabane/bigdata-playground

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Chabane/bigdata-playground)

Chabane / bigdata-playground

A complete example of a big data application using : Kubernetes (kops/aws), Apache Spark SQL/Streaming/MLib, Apache Flink, Scala, Python, Apache Kafka, Apache Hbase, Apache Parquet, Apache Avro, Apache Storm, Twitter Api, MongoDB, NodeJS, Angular, GraphQL

☆210

Alternatives and similar repositories for bigdata-playground

Users that are interested in bigdata-playground are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Chabane / generator-mitosis
View on GitHub
A micro-service infrastructure generator based on Yeoman/Chatbot, Kubernetes/Docker Swarm, Traefik, Ansible, Jenkins, Spark, Hadoop, Kafk…
☆81Dec 6, 2022Updated 3 years ago
thedataincubator / spark-structured-streaming
View on GitHub
A short course on the new, experimental features by The Data Incubator and O'Reilly Strata.
☆16Jan 25, 2017Updated 9 years ago
ansrivas / spark-structured-streaming
View on GitHub
Spark structured streaming with Kafka data source and writing to Cassandra
☆62Dec 5, 2019Updated 6 years ago
kaiwaehner / iiot-integration-apache-plc4x-kafka-connect-ksql-opc-ua-modbus-siemens-s7
View on GitHub
Industrial IoT (IIoT) Integration and Data Processing with Apache PLC4X, Kafka Connect, KSQL (OPC-UA, Modbus, Siemens S7)
☆34Aug 30, 2019Updated 6 years ago
spirom / spark-streaming-with-kafka
View on GitHub
Self-contained examples of Apache Spark streaming integrated with Apache Kafka.
☆196Apr 15, 2018Updated 8 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
dhatanian / aws-ec2-costs
View on GitHub
☆10Jun 9, 2019Updated 7 years ago
wypb / FlinkForward201709
View on GitHub
Flink Forward 201709
☆43Oct 20, 2017Updated 8 years ago
sahilbhange / spark-slowly-changing-dimension
View on GitHub
Spark implementation of Slowly Changing Dimension type 2
☆11Jan 8, 2019Updated 7 years ago
heta-io / tap
View on GitHub
Text Analytics Pipeline (TAP)
☆17Jan 4, 2026Updated 6 months ago
Yannael / kafka-sparkstreaming-cassandra
View on GitHub
Docker container for Kafka - Spark Streaming - Cassandra
☆96Jun 17, 2019Updated 7 years ago
nsadawi / ELK-Stack-Primer
View on GitHub
☆12Apr 21, 2021Updated 5 years ago
polomarcus / Spark-Structured-Streaming-Examples
View on GitHub
Spark Structured Streaming / Kafka / Cassandra / Elastic
☆186Feb 7, 2023Updated 3 years ago
Apress / bigquery-for-data-warehousing
View on GitHub
Source code for 'BigQuery for Data Warehousing' by Mark Mucchetti
☆16Sep 28, 2020Updated 5 years ago
bizzabo / elasticsearch_to_bigquery_data_pipeline
View on GitHub
A generic data pipeline which will map Elasticsearch documents to Bigquery table rows
☆14Sep 29, 2019Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
NicolaNardino / Blockchain.Ethereum
View on GitHub
Java web application backed by the Ethereum-Blockchain network. Powered by RESTful web services (JAX-RS && Spring Boot) , Docker, Kuberne…
☆14Feb 19, 2019Updated 7 years ago
themillhousegroup / scoup
View on GitHub
JSoup extensions for Scala
☆12Jun 1, 2021Updated 5 years ago
soniclavier / bigdata-notebook
View on GitHub
☆104Nov 26, 2019Updated 6 years ago
fhoffa / bigquery_patterns
View on GitHub
bigquery patterns
☆14Aug 11, 2017Updated 8 years ago
mozilla / telemetry-streaming
View on GitHub
Spark Streaming ETL jobs for Mozilla Telemetry
☆18Dec 5, 2019Updated 6 years ago
Ranlot / spark-streaming-visualize
View on GitHub
Simple demonstration of how to build a complex real time machine learning visualization tool.
☆16Mar 26, 2016Updated 10 years ago
cn0047 / benchmark-postgres-mongo
View on GitHub
Benchmarking read performance of PostgreSQL and MongoDB on same data sets.
☆16Aug 14, 2018Updated 7 years ago
sergio11 / document_search_engine_architecture
View on GitHub
📄🚀 Unleash a powerful Document Search Engine with Apache NiFi for lightning-fast, comprehensive text indexing and search.
☆30Nov 26, 2025Updated 8 months ago
daanalytics / Snowflake
View on GitHub
Snowflake scripts and useful snippets
☆16Feb 2, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dangalavan / Optimizing-DataVault-on-Snowflake
View on GitHub
Scripts complement the Optimizing a Data Vault data warehouse on the Snowflake Cloud Data Platform webinar
☆16Oct 8, 2020Updated 5 years ago
big-data-europe / docker-flink
View on GitHub
Apache Flink docker image
☆196Jul 1, 2022Updated 4 years ago
searceinc / BQconvert
View on GitHub
BigQuery Schema Conversion Tool
☆24Oct 6, 2020Updated 5 years ago
spirom / LearningSpark
View on GitHub
Scala examples for learning to use Spark
☆442Sep 17, 2020Updated 5 years ago
holdenk / spark-structured-streaming-ml
View on GitHub
Structured Streaming Machine Learning example with Spark 2.0
☆95Apr 24, 2017Updated 9 years ago
rootfs / cephfs-provisioner
View on GitHub
Kubernetes CephFS PV Provisioner
☆13Mar 31, 2017Updated 9 years ago
PacktPublishing / Large-Scale-Machine-Learning-with-Spark
View on GitHub
Code repository for Large Scale Machine Learning with Spark by Packt
☆20Oct 31, 2022Updated 3 years ago
streamnative / pulsar-spark
View on GitHub
Spark Connector to read and write with Pulsar
☆120May 26, 2026Updated 2 months ago
aws-samples / aws-netcoreapi-aurora-cdk
View on GitHub
☆12Aug 19, 2021Updated 4 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
robbidog / cqrsPythonExercises
View on GitHub
☆11Nov 17, 2020Updated 5 years ago
IBM / akka-react-cloudant
View on GitHub
A Soccer Dashboard created by scraping EPL website using Akka backend and ReactJS frontend and IBM Cloudant for object storage. IBM Cloud…
☆20Feb 14, 2022Updated 4 years ago
aravinthsci / Spark_Delta_Lake
View on GitHub
Delta Lake Examples
☆11Apr 24, 2020Updated 6 years ago
rockthejvm / spark-performance-tuning
View on GitHub
The official repository for the Rock the JVM Spark Optimization 2 course
☆45Jun 20, 2026Updated last month
phatak-dev / kubernetes-spark
View on GitHub
Docker Image and Kubernetes Configurations for Spark 2.x
☆40Oct 27, 2019Updated 6 years ago
jaceklaskowski / spark-workshop
View on GitHub
Apache Spark™ and Scala Workshops
☆264Jul 29, 2024Updated 2 years ago
sbalagop / neo
View on GitHub
RESTful APIs using Node.js, Express and Oracle (NEO)
☆22May 28, 2017Updated 9 years ago