ThoughtWorksInc / streaming-data-pipelineLinks
Streaming pipeline repo for data engineering training program
☆9Updated 5 years ago
Alternatives and similar repositories for streaming-data-pipeline
Users that are interested in streaming-data-pipeline are comparing it to the libraries listed below
Sorting:
- A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight☆13Updated 2 years ago
- Terraform module for a PostgreSQL-backed Apache Airflow instance☆24Updated 7 years ago
- Pipelines Example Applications☆14Updated 5 years ago
- Code for the course Principles Of Reactive Programming, Spring 2015 session☆23Updated 8 years ago
- Deploy an IMDB sentiment analysis model using kubernetes☆13Updated 2 years ago
- AWS bootstrap scripts for Mozilla's flavoured Spark setup.☆47Updated 5 years ago
- Chaos Testing in Kubernetes☆11Updated 7 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆61Updated 10 months ago
- Docker compose files for various kafka stacks☆32Updated 7 years ago
- Common components used across the datamountaineer kafka connect connectors☆21Updated 4 years ago
- ☆44Updated 7 years ago
- A cookiecutter template for Apache Spark applications written in Scala☆10Updated 6 years ago
- A curated list of awesome Apache Spark packages and resources.☆40Updated 8 years ago
- A Scala framework to build derived datasets, aka batch views, of Telemetry data.☆35Updated 3 years ago
- A comparison of stream-processing frameworks with Kafka integration☆10Updated 6 years ago
- Real-world Spark pipelines examples☆83Updated 7 years ago
- AMQP data source for dstream (Spark Streaming)☆26Updated 3 years ago
- Apache Marvin-AI☆100Updated 2 years ago
- Visualize statistics from the MOOC "Functional Programming Principles in Scala" using Scala!☆202Updated 11 years ago
- Some AWS EMR examples☆16Updated 7 years ago
- Onboarding to data science by ThoughtWorks☆56Updated 5 years ago
- Python Streaming Pipelines with Beam on Flink - Demo☆14Updated 2 years ago
- AWS Big Data Certification☆25Updated 6 months ago
- Apache Spark Awesome List☆14Updated 9 years ago
- These are some code examples☆55Updated 5 years ago
- A collection of examples to help show different ways to managing state in Apache Flink☆27Updated 6 years ago
- Embedded Kafka for testing and quick prototyping.☆14Updated 9 years ago
- Code from the book Machine Learning Systems☆145Updated 6 years ago
- Geo-Located Data: Extracting Patterns from Mobile Data using Scikit-Learn and Cassandra☆29Updated 7 years ago
- The purpose of this tiny project is to put things together with the know how that i learned from the course big data expert from formacio…☆62Updated 6 years ago