☆21Mar 11, 2025Updated 11 months ago
Alternatives and similar repositories for hadoop-docker
Users that are interested in hadoop-docker are comparing it to the libraries listed below
Sorting:
- ☆14Sep 14, 2021Updated 4 years ago
- ☆12Jul 27, 2021Updated 4 years ago
- ☆10Dec 23, 2023Updated 2 years ago
- Project is in active development and has been moved to https://repository.datamart.ru/datamarts/prostore.☆17Apr 22, 2022Updated 3 years ago
- Uma introdução a linguagem Go☆10Jan 11, 2024Updated 2 years ago
- Module for pipelines concept in PySpark☆16Mar 27, 2024Updated last year
- ☆13Mar 5, 2023Updated 3 years ago
- Nuvem de palavras do Plano de Governo dos candidatos à Presidência da República 2018☆11Feb 10, 2023Updated 3 years ago
- Terraform repository to deploy a fully functioning Databricks environment on top of AWS. Deploys all Databricks and AWS resources.☆16Jul 5, 2024Updated last year
- FSDS Webinar 1: Real-Time Machine Learning Inference with Spark Streaming and Kafka☆11Feb 17, 2025Updated last year
- OpenVPN PKI tools and client/server configuration generator - better than easy-rsa☆15May 16, 2025Updated 9 months ago
- Custom provider to handle Kibana API☆12Oct 2, 2023Updated 2 years ago
- Repository for in class material for Data Bootcamp☆13May 18, 2019Updated 6 years ago
- Planning Utility for Ketogenic Diets☆15Jul 7, 2012Updated 13 years ago
- ☆13Feb 18, 2022Updated 4 years ago
- Microservice that fetches Web Performance metrics☆15May 12, 2016Updated 9 years ago
- ☆17Nov 7, 2024Updated last year
- Arquivos das aulas de estatística☆12Feb 3, 2020Updated 6 years ago
- Functional Data Engineering tutorial in Python & Airflow.☆17Mar 24, 2023Updated 2 years ago
- Computing some financial measures and visualising them in Pandas☆15Sep 7, 2018Updated 7 years ago
- A place of peace and beauty where knowledge and stories are preserved.☆28Feb 27, 2026Updated last week
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆22May 30, 2022Updated 3 years ago
- 🇧🇷 Curso de introdução a Python☆15Aug 27, 2023Updated 2 years ago
- BRA responsive é um framework front-end para criação de projetos web responsivos e mobile-first.☆17Jan 21, 2017Updated 9 years ago
- Near real time ETL to populate a dashboard.☆73Sep 9, 2025Updated 6 months ago
- A new Airflow Provider for Fivetran, maintained by Astronomer and Fivetran☆23Updated this week
- Hybrid Recommender System☆23Dec 15, 2021Updated 4 years ago
- 🐋 Docker image for AWS Glue Spark/Python☆23Sep 5, 2023Updated 2 years ago
- Welcome to my data engineering projects repository! Here you will find a collection of data engineering projects that I have worked on.☆24Apr 27, 2023Updated 2 years ago
- Toolkit for Agile-driven data modeling and data loading using highly Normalized hybrid Model☆23Dec 24, 2024Updated last year
- A series of settings and useful scripts☆22Feb 4, 2026Updated last month
- ☆24Aug 8, 2021Updated 4 years ago
- ☆21Mar 26, 2023Updated 2 years ago
- Gradient boosting using categorical structure☆27May 13, 2025Updated 9 months ago
- An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS ap…☆25Dec 7, 2022Updated 3 years ago
- How to use Presto (with Hive metastore) and MinIO?☆27Mar 8, 2023Updated 3 years ago
- ☆28May 23, 2024Updated last year
- ☆26Sep 28, 2023Updated 2 years ago
- Evaluate expressions in dict/json objects☆25Apr 26, 2021Updated 4 years ago