Data Engineering with Scala, published by Packt
☆28Mar 2, 2026Updated this week
Alternatives and similar repositories for Data-Engineering-with-Scala-and-Spark
Users that are interested in Data-Engineering-with-Scala-and-Spark are comparing it to the libraries listed below
Sorting:
- Data Observability for Data Engineering, published by Packt Publishing☆11Jan 24, 2025Updated last year
- This repository contains an end-to-end data engineering project using Apache Flink, focused on performing sales analytics. The project de…☆11Nov 18, 2023Updated 2 years ago
- ☆17Jun 6, 2022Updated 3 years ago
- a space for housing all analytics engineering resources that i've found helpful or that i think may be helpful☆21Feb 20, 2024Updated 2 years ago
- CICD PIPE LINE project☆27Aug 5, 2025Updated 7 months ago
- ☆18Jun 13, 2021Updated 4 years ago
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆25Jan 26, 2024Updated 2 years ago
- Analytics engineering with dbt - projects and developer environment☆22Sep 27, 2024Updated last year
- Demonstrating the capabilities of DuckDB as a transformation engine for data lakes☆33Oct 8, 2024Updated last year
- Spring Boot application demonstrating Kafka Streams stateless and stateful processing☆29Jul 18, 2023Updated 2 years ago
- My own ETL pipeline of random users utilising Postgres for long term storage and Redis for caching. Served up via FastAPI and Docker☆31Oct 22, 2024Updated last year
- ☆30Dec 24, 2025Updated 2 months ago
- Python wrapper for Google Maps JavaScript API V3 and Google Earth API.☆17Sep 13, 2014Updated 11 years ago
- Bigdata on Kubernetes, Published by Packt☆36Oct 1, 2024Updated last year
- ฝึกนักสร้างเว็บไซต์ จาก ผู้เริ่มต้น ไปเป็น มือโปร☆15Nov 26, 2023Updated 2 years ago
- Beyond Vibe Coding. Code, Planning, Documentation and Product Management agents.☆70Feb 20, 2026Updated 2 weeks ago
- CODO is an ontology for the semantic representation and annotation of COVID-19 data in a machine-readable form for tracking history of th…☆10Apr 19, 2022Updated 3 years ago
- ☆12Updated this week
- Visual tool for SPARQL queries on graphol graphs☆10Oct 3, 2018Updated 7 years ago
- ☆10Jul 24, 2022Updated 3 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆30Updated this week
- Repository for the dbt Semantic Layer course☆12Updated this week
- Angular Frontend for the Spring Boot Microservices series☆13Jun 9, 2024Updated last year
- Hands-on Python for DevOps, published by Packt☆43Sep 1, 2025Updated 6 months ago
- This project shows how to capture changes from postgres database and stream them into kafka☆41May 17, 2024Updated last year
- Data Engineering with AWS, 2nd edition - Published by Packt☆169Oct 31, 2023Updated 2 years ago
- This project sets up a real-time data pipeline utilizing Change Data Capture (CDC) to stream changes from a PostgreSQL database to a Clic…☆12May 9, 2024Updated last year
- Elasticsearch plugin for Sentiment Analysis using Stanford CoreNLP☆11Oct 17, 2018Updated 7 years ago
- ☆12Jun 1, 2020Updated 5 years ago
- Node backend, React and Redux Toolkit frontend☆11Nov 3, 2021Updated 4 years ago
- Is using KoP (Kafka-On-Pulsar) a good idea? Use the scenarios implemented in this repository to check whether Pulsar with KoP enabled is …☆12Nov 3, 2022Updated 3 years ago
- A basic DNN tutorial in PyTorch, for persons without a background in Linux, Python, or remote servers☆10Apr 2, 2020Updated 5 years ago
- CloudPayments-SDK-Android☆10Aug 9, 2023Updated 2 years ago
- This GitHub repository contains a project that automates the provisioning of a Kubernetes (K8s) cluster using Infrastructure as Code (IaC…☆15Oct 19, 2025Updated 4 months ago
- DBT and clickhouse test project with dagster☆12Aug 29, 2023Updated 2 years ago
- Spanish text summarization demo using CoreNLP☆10Sep 13, 2014Updated 11 years ago
- Integrating with Spotify API and extracting Data. Deploying code on AWS Lambda for Data Extraction. Adding trigger to run the extraction …☆11Jul 5, 2023Updated 2 years ago
- A benchmark for generic, large-scale shuffle operations on continuous stream of data, implemented with state-of-the-art stream processing…☆14Feb 11, 2026Updated 3 weeks ago
- RDF Community Discussions. Ask anything here!☆13Apr 11, 2024Updated last year