ThoughtWorksInc / streaming-data-pipeline
Streaming pipeline repo for data engineering training program
☆9Updated 5 years ago
Alternatives and similar repositories for streaming-data-pipeline
Users that are interested in streaming-data-pipeline are comparing it to the libraries listed below
Sorting:
- Python Streaming Pipelines with Beam on Flink - Demo☆14Updated 2 years ago
- Sample Code for Thoughtful Data Science book☆15Updated 6 years ago
- A cookiecutter template for Apache Spark applications written in Scala☆10Updated 6 years ago
- Docker-izing Data Science Applications CodeLab for QCon AI 2018☆13Updated 7 years ago
- A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight☆13Updated 2 years ago
- Pipelines Example Applications☆14Updated 5 years ago
- Datasets for CS109☆28Updated 11 years ago
- A curated list of awesome Apache Spark packages and resources.☆40Updated 8 years ago
- Master complex big data processing, stream analytics, and machine learning with Apache Spark☆18Updated 2 years ago
- Spark package to "plug" holes in data using SQL based rules ⚡️ 🔌☆29Updated 5 years ago
- Pachyderm/MLeap team up to provide versioned datasets + models☆10Updated 7 years ago
- A comparison of stream-processing frameworks with Kafka integration☆10Updated 6 years ago
- Labs and data files for a full-day Spark workshop☆24Updated last year
- Terraform module for a PostgreSQL-backed Apache Airflow instance☆24Updated 7 years ago
- Chaos Testing in Kubernetes☆11Updated 7 years ago
- Deploy an IMDB sentiment analysis model using kubernetes☆13Updated 2 years ago
- Testing Scala code with scalatest☆12Updated 2 years ago
- AWS Big Data Certification☆25Updated 4 months ago
- Common components used across the datamountaineer kafka connect connectors☆21Updated 4 years ago
- Examples of all Machine Learning Algorithm in Apache Spark☆15Updated 7 years ago
- Geo-Located Data: Extracting Patterns from Mobile Data using Scikit-Learn and Cassandra☆29Updated 6 years ago
- Book code for Testing in Scala on O'Reilly☆14Updated 10 years ago
- An example PySpark project with pytest☆16Updated 7 years ago
- Spark UDFs to deserialize Avro messages with schemas stored in Schema Registry.☆19Updated 7 years ago
- Mirror of Apache Beam☆10Updated 4 years ago
- Sketching data structures for scala, including t-digest☆15Updated 3 years ago
- Notes and projects for my book, “Functional Programming, Simplified"☆18Updated 7 years ago
- Apache Spark under Docker☆9Updated 9 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Source code for http://allaboutscala.com/scala-cheatsheet/☆9Updated 6 years ago