ThoughtWorksInc / streaming-data-pipeline
Streaming pipeline repo for data engineering training program
☆9Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for streaming-data-pipeline
- Labs and data files for a full-day Spark workshop☆24Updated last year
- Deploy an IMDB sentiment analysis model using kubernetes☆13Updated last year
- A cookiecutter template for Apache Spark applications written in Scala☆10Updated 5 years ago
- A curated list of awesome Apache Spark packages and resources.☆40Updated 7 years ago
- Terraform module for a PostgreSQL-backed Apache Airflow instance☆24Updated 6 years ago
- Sample Code for Thoughtful Data Science book☆15Updated 5 years ago
- A basic example of how to read and write streaming data using Apache Spark and Kafka on HDInsight☆13Updated last year
- These are some code examples☆55Updated 4 years ago
- AWS Big Data Certification☆25Updated last year
- Docker-izing Data Science Applications CodeLab for QCon AI 2018☆13Updated 6 years ago
- Lighthouse is a library for data lakes built on top of Apache Spark. It provides high-level APIs in Scala to streamline data pipelines an…☆60Updated 2 months ago
- Real-world Spark pipelines examples☆83Updated 6 years ago
- personal cheatsheets on various technologies☆25Updated 8 years ago
- Mirror of Apache Beam☆10Updated 3 years ago
- Docker compose files for various kafka stacks☆33Updated 6 years ago
- Spark with Scala example projects☆33Updated 5 years ago
- Datasets for CS109☆28Updated 11 years ago
- Pipelines Example Applications☆15Updated 5 years ago
- Pachyderm/MLeap team up to provide versioned datasets + models☆10Updated 7 years ago
- Example project which simulates an interesting analytics use case using MemSQL Pipelines.☆14Updated 7 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 5 years ago
- ☆26Updated 3 years ago
- Sketching data structures for scala, including t-digest☆15Updated 3 years ago
- Examples and explanations of how RPC systems works.☆25Updated last year
- ☆24Updated 8 years ago
- ☆9Updated 9 years ago
- Challenge for those applying to the Software Engineer, Big Data position☆34Updated 13 years ago