☆21Mar 11, 2025Updated last year
Alternatives and similar repositories for hadoop-docker
Users that are interested in hadoop-docker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Mar 24, 2023Updated 3 years ago
- ☆13Mar 5, 2023Updated 3 years ago
- This Jupyter Notebook was used to give a workshop about Machine Learning with Python in Brazil with the support of WoMakersCode Community…☆14Nov 24, 2018Updated 7 years ago
- A sample implementation of stream writes to an Iceberg table on GCS using Flink and reading it using Trino☆22May 30, 2022Updated 3 years ago
- ☆12Jul 27, 2021Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- how to unit test your PySpark code☆29Mar 26, 2021Updated 5 years ago
- ☆17Apr 17, 2026Updated last month
- A starter kit for static websites.☆28Jul 22, 2018Updated 7 years ago
- ☆13Feb 18, 2022Updated 4 years ago
- Near real time ETL to populate a dashboard.☆75Sep 9, 2025Updated 8 months ago
- ETL processing toolset with SQL-like language and GIS capabilities, built on core Spark. Extensible and modular. REPL included☆16May 12, 2026Updated 2 weeks ago
- ☆15May 7, 2025Updated last year
- Testing Sandbox for Hadoop Ecosystem Components☆45Updated this week
- Материалы курса Airflow 101☆15Jun 15, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Nuvem de palavras do Plano de Governo dos candidatos à Presidência da República 2018☆11Feb 10, 2023Updated 3 years ago
- Uma introdução a linguagem Go☆10Jan 11, 2024Updated 2 years ago
- Some example projects for Data Engineers to build, end-to-end.☆39Nov 8, 2023Updated 2 years ago
- ☆16Feb 12, 2025Updated last year
- Terraform repository to deploy a fully functioning Databricks environment on top of AWS. Deploys all Databricks and AWS resources.☆17Jul 5, 2024Updated last year
- Microservices Design☆13Jan 4, 2023Updated 3 years ago
- Sample ELT project using Dagster, data load tool and Snowflake☆43Jul 20, 2024Updated last year
- Material used for the introduction to quantum programming workshop prepared by QLatvia and edited by QTurkey☆36May 2, 2021Updated 5 years ago
- End-to-end data platform leveraging the Modern data stack☆52Apr 10, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆38Apr 28, 2025Updated last year
- Microservice that fetches Web Performance metrics☆15May 12, 2016Updated 10 years ago
- Analytics Engineer Course☆20May 17, 2023Updated 3 years ago
- A new Airflow Provider for Fivetran, maintained by Astronomer and Fivetran☆25Apr 17, 2026Updated last month
- ☆21Mar 26, 2023Updated 3 years ago
- How to evaluate the Quality of your Data with Great Expectations and Spark.☆32Mar 29, 2023Updated 3 years ago
- Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data☆52Dec 2, 2023Updated 2 years ago
- LSTM model for Vietnamese Named Entity Recognition☆17Jul 26, 2017Updated 8 years ago
- Advent of code - 30 challenges for learning Dagster☆27Dec 19, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- End to end data engineering project☆59Oct 27, 2022Updated 3 years ago
- Code to demonstrate data engineering metadata & logging best practices☆21Mar 12, 2024Updated 2 years ago
- An End-to-End ETL data pipeline that leverages pyspark parallel processing to process about 25 million rows of data coming from a SaaS ap…☆25Dec 7, 2022Updated 3 years ago
- edaSQL is a python library to bridge the SQL with Exploratory Data Analysis where you can connect to the Database and insert the queries.…☆10Nov 14, 2021Updated 4 years ago
- ☆27Aug 30, 2024Updated last year
- Gradient boosting using categorical structure☆27May 13, 2025Updated last year
- Материалы марафона Готовим данные☆23Oct 14, 2021Updated 4 years ago