Proof of concept of a big data cluster using open source tools
☆11Apr 10, 2024Updated last year
Alternatives and similar repositories for open-source-datalake
Users that are interested in open-source-datalake are comparing it to the libraries listed below
Sorting:
- ☆12Sep 29, 2021Updated 4 years ago
- A Node App that downloads builds from Unity Cloud Build and uploads them to Steam.☆18Jan 6, 2024Updated 2 years ago
- A noise function, that can be replaced with RNG, Squirrel Eiserloh introduced at GDC17☆19Mar 1, 2017Updated 9 years ago
- Calculadora feita com Tkinter☆14Sep 30, 2020Updated 5 years ago
- Playground for Lakehouse (Iceberg, Hudi, Spark, Flink, Trino, DBT, Airflow, Kafka, Debezium CDC)☆66Sep 23, 2023Updated 2 years ago
- 🤖 An autonomous AI agent system that collaboratively designs, implements, and manages Apache Airflow DAGs through natural language inter…☆28Aug 6, 2025Updated 7 months ago
- Send logs to Telegram chat via Telegram bot in your Laravel application☆12Dec 13, 2023Updated 2 years ago
- Modernize seu Data Warehouse☆15Nov 12, 2024Updated last year
- ☆25Mar 15, 2024Updated 2 years ago
- ☆20Jan 19, 2024Updated 2 years ago
- Spending One Hundred days on blogging about cloud computing☆14Jul 12, 2022Updated 3 years ago
- Repositório dedicado a Workshop de Data Lakehouse com Delta Lake☆17Dec 6, 2021Updated 4 years ago
- Script para ingestão de dados do Mercado Bitcoin☆11Jun 29, 2023Updated 2 years ago
- ☆11Oct 1, 2025Updated 5 months ago
- ☆19May 25, 2025Updated 9 months ago
- Capstone Project: Predicting default in P2P lending☆12Feb 27, 2017Updated 9 years ago
- Builds a jenkins docker file that can run docker builds☆26Mar 30, 2020Updated 5 years ago
- Airflow Examples: code samples for Medium articles☆14Jan 10, 2021Updated 5 years ago
- Advanced Raidfinder for Granblue Fantasy. 【グランブルファンタジー】のTwitter救援をまとめ☆16Apr 29, 2023Updated 2 years ago
- An extension of the fluent validation with a set of Brazilian validations☆34Sep 10, 2022Updated 3 years ago
- Repository of Docker builds for Oracle databases.☆18Jul 3, 2023Updated 2 years ago
- Projeto Stack de dados OSS☆12Apr 8, 2025Updated 11 months ago
- Plug regular expression models into OCR string results of document pictures to extract structured data!☆23Nov 18, 2021Updated 4 years ago
- Utilizando o GitHub para expor seus projetos de Data Science - Materiais☆17Apr 27, 2021Updated 4 years ago
- Repositório de scripts de cursos da Abraji no 14º Congresso da Abraji☆14Dec 18, 2019Updated 6 years ago
- Spark development environment for kubernetes, spark-submit and jupyter notebook☆19Nov 30, 2021Updated 4 years ago
- ☆13Jun 27, 2023Updated 2 years ago
- An ETL Orchestration using Apache Airflow to extract CSV files from a Google Drive, validate, transform, and load into a PostgreSQL datab…☆26Jun 30, 2024Updated last year
- ☆10Feb 22, 2022Updated 4 years ago
- Código para workshops Spark com ambiente de desenvolvimento em docker☆28Oct 1, 2021Updated 4 years ago
- An open synthetic population of Sao Paulo Metropolitan region for agent-based transport simulation☆16Jul 6, 2023Updated 2 years ago
- Configura containers do Spark (Master, Workers e History Server) + Jupyter☆21Jun 17, 2024Updated last year
- Personal roadmap to guide my studies.☆81May 7, 2022Updated 3 years ago
- The lazy way to run multi-statement Neo4j Cypher scripts from the web☆11Nov 21, 2016Updated 9 years ago
- ☆14Nov 5, 2025Updated 4 months ago
- Email Analysis Tool based on Hadoop☆20Apr 26, 2021Updated 4 years ago
- ☆11Jan 25, 2017Updated 9 years ago
- Simple, powerful & fast utility to migrate Oracle to Postgresql☆18May 3, 2024Updated last year
- PHP Package for Autentique API-v2 | Ref: https://docs.autentique.com.br/api/☆43Nov 29, 2024Updated last year