Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand
☆56Sep 30, 2023Updated 2 years ago
Alternatives and similar repositories for ease-with-apache-spark
Users that are interested in ease-with-apache-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Public Docker Images for popular services☆50Sep 7, 2025Updated 6 months ago
- Analysis of 311 Service Requests for the City of NYC (from 2010 to 2023) Tech: Prefect cloud, dbt core, BigQuery, Compute Engine, CloudRu…☆20Apr 5, 2023Updated 2 years ago
- In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …☆39Dec 18, 2023Updated 2 years ago
- ☆26May 13, 2025Updated 10 months ago
- ☆70Mar 1, 2026Updated 3 weeks ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Pluralsight trainings code☆10Jun 24, 2021Updated 4 years ago
- Automation of Databricks workflows☆13Nov 9, 2025Updated 4 months ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆65Jul 21, 2023Updated 2 years ago
- ☆18Jul 7, 2025Updated 8 months ago
- Collection of code examples, snippets, demos for running Python in Snowflake☆24Aug 6, 2025Updated 7 months ago
- ☆15Mar 29, 2024Updated last year
- ☆22Feb 5, 2024Updated 2 years ago
- Ravi Azure ADB ADF Repository☆65Jan 25, 2025Updated last year
- HCM Data Loading Sample using JAVA☆11Aug 24, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repo consists of all important concepts for data engineers.☆11Dec 24, 2024Updated last year
- ☆15Oct 19, 2023Updated 2 years ago
- SAM3 ROS1/ROS2 wrapper☆48Mar 9, 2026Updated 2 weeks ago
- Resources for software/backend/data learning | #SE | #DE | #DS☆17Nov 16, 2025Updated 4 months ago
- Batch Processing , orchestration using Apache Airflow and Google Workflows, spark structured Streaming and a lot more☆18Jun 21, 2022Updated 3 years ago
- My Git Repo for Csv Data☆21Oct 5, 2025Updated 5 months ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆145Jul 27, 2023Updated 2 years ago
- Extract, transform, and load data for analytic processing using AWS Glue☆17May 2, 2021Updated 4 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆10Jun 6, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This project demonstrates the use of Deep Learning to detect emotion (sad, angry, happy etc) from the images of faces.☆11Feb 14, 2020Updated 6 years ago
- ☆46Jul 6, 2024Updated last year
- Sample code demonstrating how you can use Oracle Cloud Infrastructure serverless components to load data into Oracle Fusion ERP☆12Aug 24, 2023Updated 2 years ago
- A course on building Large Language Models☆12Mar 24, 2025Updated last year
- Tools for Microsoft Fabric☆25Jul 17, 2025Updated 8 months ago
- ☆24Feb 4, 2026Updated last month
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆25Jan 26, 2024Updated 2 years ago
- A Wordpress AWS install controlled by Terraform 0.11.x☆11Nov 24, 2022Updated 3 years ago
- ☆16Feb 17, 2020Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- An example repository showing how to leverage Kafka to stream your data☆21May 11, 2024Updated last year
- Easily deploy airflow infrastructure on an AWS VPC using terraform.☆11Apr 9, 2019Updated 6 years ago
- 한국사데이터베이스 스크래핑☆24Feb 15, 2024Updated 2 years ago
- ☆33Sep 2, 2024Updated last year
- WorldQuant University Deep Learning for Computer Vision Certification Projects☆22Apr 3, 2025Updated 11 months ago
- This repo is for the Linkedin Learning course: Hands-On AI: Build a Generative Language Model from Scratch☆34Dec 28, 2023Updated 2 years ago
- Fine-tune Mistral-7b on the Enlighten codebase☆28Feb 6, 2024Updated 2 years ago