☆14Mar 11, 2023Updated 2 years ago
Alternatives and similar repositories for de02-pyspark-optimization
Users that are interested in de02-pyspark-optimization are comparing it to the libraries listed below
Sorting:
- ☆16Feb 17, 2026Updated 2 weeks ago
- Repository to host micro service implementation patterns.☆13Jun 25, 2025Updated 8 months ago
- ☆12Jul 27, 2021Updated 4 years ago
- This solution provides the AWS CDK and AWS CloudFormation infrastructure to build an enterprise data mesh with Amazon DataZone.☆10May 7, 2025Updated 10 months ago
- ☆20Jun 29, 2022Updated 3 years ago
- ☆18Apr 20, 2025Updated 10 months ago
- ☆10May 5, 2022Updated 3 years ago
- ☆10Mar 30, 2024Updated last year
- Kafka education materials☆46Aug 22, 2025Updated 6 months ago
- Code for the paper "Active learning for medical image segmentation with stochastic batches", published at Medical Image Analysis (2023).☆10Nov 14, 2024Updated last year
- Project is in active development and has been moved to https://repository.datamart.ru/datamarts/prostore.☆17Apr 22, 2022Updated 3 years ago
- Creates fields to augment item indexes with behavioral indicators to personalize search☆14Jun 4, 2016Updated 9 years ago
- ☆11Feb 3, 2025Updated last year
- ☆17Jan 12, 2026Updated last month
- ☆15May 7, 2025Updated 10 months ago
- Query, analysis, and visualization of large video collections☆10Dec 9, 2022Updated 3 years ago
- Toy Hadoop cluster combining various SQL-on-Hadoop variants☆13Nov 16, 2017Updated 8 years ago
- ☆10Jan 18, 2024Updated 2 years ago
- This is an example of using MongoDB as both a source and sink.☆10May 21, 2020Updated 5 years ago
- 🧠 Interactive LeetCode tracker with spaced repetition system. Track coding problem progress, schedule reviews, and retain knowledge long…☆30Feb 6, 2026Updated last month
- This repository is a Challenge for the DevOps Community to get stronger in DevOps. This challenge starts on the 1st January 2023 and in …☆11Jan 4, 2024Updated 2 years ago
- Bot for scraping OKCupid user data☆12Feb 27, 2024Updated 2 years ago
- This project provides an end-to-end data processing and visualization of visa numbers in Japan using PySpark and Plotly. The spark cluste…☆12Oct 11, 2023Updated 2 years ago
- A repository containing scripts that deploy open source data tools (Airflow, Airbyte, Grafana) to Google Kubernetes Engine☆14May 31, 2024Updated last year
- Module for pipelines concept in PySpark☆16Mar 27, 2024Updated last year
- chef cookbook to install Apache Spark☆10Jul 17, 2015Updated 10 years ago
- My solution to FashionAI Key Points Detection of Apparel.☆13May 31, 2018Updated 7 years ago
- List item with awesome hover effect☆12Jul 9, 2020Updated 5 years ago
- Example application demonstrating how to integrate all of the components of Hortonworks DataFlow.☆14Jul 10, 2017Updated 8 years ago
- LoL Esports Voice Analytics Capstone Project☆13Aug 18, 2025Updated 6 months ago
- ☆13Feb 20, 2025Updated last year
- Part of the IOP Stack™ Morpheus is a toolset to have gatekeeper-free identity management and verifiable claims as a 2nd layer on top of a…☆14Feb 12, 2026Updated 3 weeks ago
- An ergonomic, opinionated memory interface for AI agents☆39Dec 18, 2025Updated 2 months ago
- Automating web browser using Selenium/ Python.☆12Apr 11, 2024Updated last year
- Generate files for dictd server [Python] (unmaintained)☆12Apr 21, 2010Updated 15 years ago
- Minimal example of how to use FastAPI and Supabase Auth. Draws reference from: https://testdriven.io/blog/fastapi-jwt-auth/ and https://w…☆15Dec 12, 2022Updated 3 years ago
- Airflow 2.0 configuration with Celery Executor based on Docker containers with Postgres and Redis broker plus Flower and Webserver UI☆15May 13, 2021Updated 4 years ago
- Airbyte deployment and configuration management tool☆12Feb 5, 2022Updated 4 years ago
- Examples on the use of covid19db database☆12Jan 25, 2021Updated 5 years ago