sciabarra / BigDataDevKitLinks
Big Data Development Kit (Hadoop / Spark / Zeppelin / IntelliJ)
☆22Updated 9 years ago
Alternatives and similar repositories for BigDataDevKit
Users that are interested in BigDataDevKit are comparing it to the libraries listed below
Sorting:
- CDAP Applications☆44Updated 7 years ago
 - TensorFlow Processor for Spring Cloud Dataflow☆24Updated 8 years ago
 - Repository with DC/OS demos to show specific use cases, usually industry specific.☆102Updated 3 years ago
 - A docker image for testing MemSQL + MemSQL Ops☆68Updated 2 years ago
 - A DC/OS time series demo☆62Updated 9 years ago
 - Core OJAI APIs☆47Updated last year
 - [DEPRECATED] Script used to manage Hadoop and Spark instances on Google Compute Engine☆109Updated 5 years ago
 - Scripts and instructions for Zero To Cloud With NetflixOSS☆149Updated 6 years ago
 - Docker Image and Kubernetes Configurations for Spark 2.x☆41Updated 6 years ago
 - 4-day deep dive of docker + kubernetes☆34Updated 9 years ago
 - An example Apache Beam project.☆111Updated 8 years ago
 - Sample Spark Code☆91Updated 7 years ago
 - Class files for Fast Track to Python☆19Updated 11 years ago
 - ☆46Updated 7 years ago
 - Get started with Apache Beam and Flink☆43Updated 9 years ago
 - This project is no longer actively supported. It is made available as read-only. A highly available, horizontally scalable queuing and no…☆275Updated 6 years ago
 - IoT - It's the thing you want! And so here's a full-stack demo.☆62Updated 9 years ago
 - Getting started with Pulsar and Cassandra☆20Updated 4 years ago
 - Fusion demo app searching open-source project data from the Apache Software Foundation☆43Updated 7 years ago
 - A scalable, distributed Time Series Database.☆28Updated 10 years ago
 - Bullet is a streaming query engine that can be plugged into any singular data stream using a Stream Processing framework like Apache Stor…☆41Updated 2 years ago
 - This code demonstrates the deployment of OpenWhisk on Kubernetes cluster from Bluemix container service☆33Updated 3 years ago
 - Basic getting started with Kafka examples☆47Updated 6 years ago
 - Berserker is load generator with pluggable input source and configurable output.☆53Updated 2 years ago
 - A collection of examples to help show different ways to managing state in Apache Flink☆27Updated 6 years ago
 - Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabyt…☆137Updated 3 years ago
 - Herd-MDL, a turnkey managed data lake in the cloud. See https://finraos.github.io/herd-mdl/ for more information.☆15Updated last year
 - Create Kafka-Connect clusters with docker . You put the Kafka, we put the Connect.☆25Updated 6 years ago
 - Reference architecture for real-time stream processing with Apache Flink on Amazon EMR, Amazon Kinesis, and Amazon Elasticsearch Service.☆72Updated last year
 - Cloudbreak Deployer Tool☆34Updated 2 years ago