Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand
☆56Sep 30, 2023Updated 2 years ago
Alternatives and similar repositories for ease-with-apache-spark
Users that are interested in ease-with-apache-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Public Docker Images for popular services☆51Sep 7, 2025Updated 7 months ago
- Learn PySpark from Basics to Advanced. Checkout the YouTube Series : [PySpark - Zero to Hero]☆138Sep 7, 2025Updated 7 months ago
- Analysis of 311 Service Requests for the City of NYC (from 2010 to 2023) Tech: Prefect cloud, dbt core, BigQuery, Compute Engine, CloudRu…☆20Apr 5, 2023Updated 3 years ago
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆14Dec 27, 2023Updated 2 years ago
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆39Dec 18, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆28May 13, 2025Updated 11 months ago
- ☆70Mar 1, 2026Updated last month
- DBT and clickhouse test project with dagster☆12Aug 29, 2023Updated 2 years ago
- Automation of Databricks workflows☆13Nov 9, 2025Updated 5 months ago
- ☆18Jul 7, 2025Updated 9 months ago
- Hands-On Data Warehousing with Azure Data Factory, published by Packt☆15Jan 18, 2023Updated 3 years ago
- ☆22Feb 5, 2024Updated 2 years ago
- ☆15Mar 29, 2024Updated 2 years ago
- Simplify Big Data Analytics with Amazon EMR, published by Packt☆13Jan 18, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A basic Text2SQL App, powered by Langchain and OpenAI.☆12Jun 27, 2024Updated last year
- ☆14Nov 2, 2022Updated 3 years ago
- ☆29Jul 26, 2022Updated 3 years ago
- This repo consists of all important concepts for data engineers.☆11Dec 24, 2024Updated last year
- Azure Data Factory Cookbook_Second Edition, published by Packt☆19Feb 29, 2024Updated 2 years ago
- PoS crypto coin over ipfs distributed storage network (with new consensus protocol 🙌)☆16Apr 10, 2024Updated 2 years ago
- Resources for software/backend/data learning | #SE | #DE | #DS☆17Nov 16, 2025Updated 5 months ago
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more☆18Jun 21, 2022Updated 3 years ago
- My Git Repo for Csv Data☆21Oct 5, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Building an ML model to identify ROIs on marketing campaigns and their impacts on sales and customer conversions.☆17Sep 4, 2021Updated 4 years ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆146Jul 27, 2023Updated 2 years ago
- SAM3 ROS1/ROS2 wrapper☆49Mar 9, 2026Updated last month
- This project demonstrates the use of Deep Learning to detect emotion (sad, angry, happy etc) from the images of faces.☆11Feb 14, 2020Updated 6 years ago
- A course on building Large Language Models☆13Mar 24, 2025Updated last year
- Demoing how to use Matrix and Each definitions in Azure DevOps YAML pipelines.☆19Apr 1, 2026Updated 2 weeks ago
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆25Jan 26, 2024Updated 2 years ago
- A buildroot based Raspberry Pi emulator platform☆22Feb 7, 2015Updated 11 years ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆323Feb 14, 2025Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The repository for my talk titled the same☆15Nov 20, 2019Updated 6 years ago
- ☆33Sep 2, 2024Updated last year
- End to End Pipeline using AWS Services such as s3, boto3, lambda, ECR, step functions, Dynamodb, Step Functions, etc☆23Jul 31, 2022Updated 3 years ago
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆104Sep 26, 2025Updated 6 months ago
- ☆21Oct 21, 2024Updated last year
- This is a script that will use all of the accounts you have and follow a specified xbox profile☆20Jan 16, 2024Updated 2 years ago
- Apache Spark 3 for Data Engineering and Analytics with Python , By Packt publishing☆24Jul 23, 2023Updated 2 years ago