Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand
☆55Sep 30, 2023Updated 2 years ago
Alternatives and similar repositories for ease-with-apache-spark
Users that are interested in ease-with-apache-spark are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Analysis of 311 Service Requests for the City of NYC (from 2010 to 2023) Tech: Prefect cloud, dbt core, BigQuery, Compute Engine, CloudRu…☆20Apr 5, 2023Updated 3 years ago
- This project showcases how to integrate the world of DevOps, focusing on Continuous Integration (CI) and Continuous Deployment (CD) with …☆14Dec 27, 2023Updated 2 years ago
- ☆29May 13, 2025Updated 11 months ago
- ☆69Updated this week
- Automation of Databricks workflows☆13Nov 9, 2025Updated 5 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆65Jul 21, 2023Updated 2 years ago
- ☆23Feb 5, 2024Updated 2 years ago
- ☆15Mar 29, 2024Updated 2 years ago
- HCM Data Loading Sample using JAVA☆11Aug 24, 2023Updated 2 years ago
- A simple example to showcase machine learning model deployment with an API☆10Mar 7, 2022Updated 4 years ago
- ☆15Oct 19, 2023Updated 2 years ago
- My Git Repo for Csv Data☆21Oct 5, 2025Updated 7 months ago
- Get data from API, run a scheduled script with Airflow, send data to Kafka and consume with Spark, then write to Cassandra☆146Jul 27, 2023Updated 2 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆10Jun 6, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- This project demonstrates the use of Deep Learning to detect emotion (sad, angry, happy etc) from the images of faces.☆11Feb 14, 2020Updated 6 years ago
- ☆14Nov 21, 2015Updated 10 years ago
- ☆46Jul 6, 2024Updated last year
- Build a Reddit Content Research Agent with LLMs, LangChain, SERP, Jupyter, Django, Bright Data, Celery, Django QStash, and much more.☆25Sep 11, 2025Updated 7 months ago
- ☆11Jan 14, 2024Updated 2 years ago
- ☆25Feb 4, 2026Updated 3 months ago
- This repository contains the necessary configuration files and DAGs (Directed Acyclic Graphs) for setting up a robust data engineering en…☆25Jan 26, 2024Updated 2 years ago
- A Wordpress AWS install controlled by Terraform 0.11.x☆11Apr 19, 2026Updated 2 weeks ago
- This is a Disney Plus+ Clone In Django.☆16Jun 6, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A fully featured banking API built with FastAPI,Docker,Celery,Redis,RabbitMQ with an AI/ML transaction analysis and fraud detection syste…☆21Sep 4, 2025Updated 8 months ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆323Feb 14, 2025Updated last year
- An image search application using multimodal embeddings and Azure AI search vector search.☆28Mar 27, 2026Updated last month
- This guide will demonstrate how to deploy a minimal Apache Kafka cluster on Docker and set up producers and consumers using Python. We wi…☆18Nov 15, 2020Updated 5 years ago
- springboot demo combined with scala and java☆11Dec 7, 2017Updated 8 years ago
- paraphase sentence☆11Aug 22, 2025Updated 8 months ago
- Making Desktop App with PyQt5 and matplotlib☆16Dec 19, 2019Updated 6 years ago
- Repo will try to cover all the most frequently used ML algos with proper explanation and examples☆10Apr 14, 2019Updated 7 years ago
- An example repository showing how to leverage Kafka to stream your data☆21May 11, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- An open source enterprise data warehousing and analysis platform.☆22Nov 8, 2021Updated 4 years ago
- ☆21Apr 13, 2026Updated 3 weeks ago
- End to End Pipeline using AWS Services such as s3, boto3, lambda, ECR, step functions, Dynamodb, Step Functions, etc☆23Jul 31, 2022Updated 3 years ago
- This repository will help you to learn about databricks concept with the help of examples. It will include all the important topics which…☆104Sep 26, 2025Updated 7 months ago
- ☆21Oct 21, 2024Updated last year
- Apache Spark 3 for Data Engineering and Analytics with Python , By Packt publishing☆24Jul 23, 2023Updated 2 years ago
- ELT Data Pipeline implementation in Data Warehousing environment☆30May 2, 2025Updated last year