Data Engineering with Scala, published by Packt
☆28Mar 2, 2026Updated last month
Alternatives and similar repositories for Data-Engineering-with-Scala-and-Spark
Users that are interested in Data-Engineering-with-Scala-and-Spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Python wrapper for Google Maps JavaScript API V3 and Google Earth API.☆17Sep 13, 2014Updated 11 years ago
- a space for housing all analytics engineering resources that i've found helpful or that i think may be helpful☆21Feb 20, 2024Updated 2 years ago
- Mastering Apache Storm, published by Packt☆13Oct 30, 2023Updated 2 years ago
- I'm a lonely, lonely lord. Pick another day to be restored. Just another fate that's going overboard.☆21Jan 27, 2023Updated 3 years ago
- will add all data science project that I'll do.☆11May 14, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Integrating with Spotify API and extracting Data. Deploying code on AWS Lambda for Data Extraction. Adding trigger to run the extraction …☆12Jul 5, 2023Updated 2 years ago
- My own ETL pipeline of random users utilising Postgres for long term storage and Redis for caching. Served up via FastAPI and Docker☆31Oct 22, 2024Updated last year
- ☆14Dec 15, 2025Updated 4 months ago
- This project shows how to capture changes from postgres database and stream them into kafka☆42May 17, 2024Updated last year
- Get an OpenCV video capture from an YouTube video URL☆27Aug 26, 2024Updated last year
- Data Engineering with AWS, 2nd edition - Published by Packt☆170Oct 31, 2023Updated 2 years ago
- Analytics engineering with dbt - projects and developer environment☆22Sep 27, 2024Updated last year
- DataTalks Workshop Materials☆19Mar 18, 2024Updated 2 years ago
- Predicción simple de casos de Covid-19 en Mexico☆10Apr 6, 2020Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- dbt-databend adapter plugin☆10May 30, 2024Updated last year
- Databricks ML in Action, Published by Packt☆34Mar 2, 2026Updated last month
- dbt Project for Rapid Onboarding instructors to use in instruction and learners to reference throughout the course.☆26Apr 10, 2026Updated last week
- ☆14Apr 8, 2026Updated last week
- A list of MCP services for popular data tools☆20Jul 14, 2025Updated 9 months ago
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆25Jan 26, 2024Updated 2 years ago
- ☆18Jun 13, 2021Updated 4 years ago
- Use dbt to manage real-time data transformations in RisingWave.☆35Apr 8, 2026Updated last week
- Linkedin Webscraper is a tool for search jobs publications (or other publications) with a keyword. Download data to excel file.☆24Feb 16, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- TON cuda and opencl miner☆15Feb 13, 2024Updated 2 years ago
- Trying out the Dataframe Polars library with Delta Lake ... feat Python.☆12Jan 29, 2025Updated last year
- dlt-dagster-demo☆13Nov 6, 2023Updated 2 years ago
- Spring Boot application demonstrating Kafka Streams stateless and stateful processing☆29Jul 18, 2023Updated 2 years ago
- Python Binding for Rust WhatLang, a language detection library☆14Jan 5, 2024Updated 2 years ago
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆10Feb 2, 2024Updated 2 years ago
- Demos for Nessie. Nessie provides Git-like capabilities for your Data Lake.☆31Updated this week
- CloudPayments-SDK-Android☆10Aug 9, 2023Updated 2 years ago
- Bigdata on Kubernetes, Published by Packt☆36Oct 1, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- End-to-End ELT data pipeline with Postgres, Airbyte, dbt, Dagster, Snowflake and Metabase☆11Jul 13, 2023Updated 2 years ago
- Templates for your Note Taking Markdown Workflow☆21Mar 5, 2023Updated 3 years ago
- Repository containing scripts and files for 16S gene community analysis chapter in Methods in Molecular Biology☆25May 22, 2019Updated 6 years ago
- DBT and clickhouse test project with dagster☆12Aug 29, 2023Updated 2 years ago
- A tool to generate PySpark schema from JSON.☆29Jan 21, 2024Updated 2 years ago
- A curated list of dagster code snippets for data engineers☆56Feb 26, 2024Updated 2 years ago
- ☆15Apr 29, 2024Updated last year