Este é um projeto de exemplo que demonstra um processo de ETL (Extração, Transformação e Carga) de dados usando Python, Polars e AWS LocalStack. Ele foi projetado para extrair informações de um artista musical do Spotify, transformar esses dados em diferentes formatos e carregá-los em um "datalake" local usando o LocalStack.
☆15Sep 25, 2023Updated 2 years ago
Alternatives and similar repositories for datalake-format-explorer
Users that are interested in datalake-format-explorer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Repositório central do segundo Workshop☆16Nov 15, 2023Updated 2 years ago
- ETL e visualização do Censo escolar☆10May 3, 2023Updated 2 years ago
- [ARCHIVED] Historical labs-postgresql project - no longer maintained☆23Jan 5, 2026Updated 2 months ago
- A repository to store example files and projects for my YouTube series **Docker Development Tips & Tricks**☆13Dec 1, 2021Updated 4 years ago
- openweb UI scripts☆12Jan 27, 2026Updated last month
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- "What the teacher is, is more important than what he teaches."― Karl Menninger☆16Sep 10, 2021Updated 4 years ago
- Portfólio de Data Science☆51Nov 24, 2024Updated last year
- Telegram bot with some functions☆11May 12, 2020Updated 5 years ago
- Book - Structure of Data Algorithms in JavaScript☆10Jan 30, 2024Updated 2 years ago
- ☆10Apr 10, 2025Updated 11 months ago
- ☆16Apr 1, 2025Updated 11 months ago
- Repository for mne docker images☆13Sep 1, 2025Updated 6 months ago
- Oficina Python Fluente oferecida no Garoa Hacker Clube a partir de 30/jul/2024☆10Aug 30, 2024Updated last year
- Evaluates Twitter accounts to determine whether they're bots☆16May 20, 2019Updated 6 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This Python package provides the high-level API by which you can interact with the MQ Light runtime.☆14Apr 29, 2021Updated 4 years ago
- workshop 03 - como montar um dw pagando pouco☆35Dec 18, 2023Updated 2 years ago
- This is a mini project on FastAPI as a backend API and Jinja2 Template with Bootstrap5 and MongoDB as the Database.☆17Jan 2, 2022Updated 4 years ago
- An ETL Pipeline built over GCP and orchestrated by Mage, which involves Extracting Data from GCS Bucket, building Dimensional Model (Star…☆13Aug 26, 2023Updated 2 years ago
- Study project☆15May 5, 2020Updated 5 years ago
- Docker compose and Google Colab demo to build a CDC with Delta Lake☆15Sep 7, 2022Updated 3 years ago
- Passo a passo para instalar no linux/wsl ubuntu☆10Jan 13, 2024Updated 2 years ago
- Repositório da imersao Databricks☆203Oct 15, 2025Updated 5 months ago
- Codebase for the backend of VUTTR (Very Useful Tools to Remember)☆12Jan 24, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Advanced Cleaning Techniques with Python☆18Aug 18, 2019Updated 6 years ago
- aprendendo a usar o git do basico ao avançado.☆12Nov 3, 2019Updated 6 years ago
- ☆13Jul 10, 2023Updated 2 years ago
- Simple MLP in Python using Numpy☆10Nov 17, 2018Updated 7 years ago
- 🎵 Pesquisa uma palavra ou frase dentro de todas as musicas de um artista.☆20Jul 31, 2023Updated 2 years ago
- Realistic OLTP data simulator for CDC testing with Debezium☆17Nov 5, 2025Updated 4 months ago
- Google Developers Student Club - Data Science Bootcamp 2022☆11May 18, 2022Updated 3 years ago
- Configura containers do Spark (Master, Workers e History Server) + Jupyter☆21Jun 17, 2024Updated last year
- RAG-based Chatbot that helps answer questions around healthy eating & lifestyle choices, based on 1200+ science-backed blog posts of Nutr…☆13Sep 15, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Repositório do Curso Online Python Fundamentos☆20Aug 26, 2016Updated 9 years ago
- universal-datalakehouse-postgres-ingestion-deltastreamer☆11Apr 7, 2024Updated last year
- This project demonstrates an end-to-end data pipeline, integrating cloud storage, data processing, and real-time visualization. It serve…☆14Dec 2, 2024Updated last year
- ☆24Mar 31, 2025Updated 11 months ago
- New web apps for Operaton☆20Mar 11, 2026Updated 2 weeks ago
- Tutorial do processo de ETL usando python e suas bibliotecas☆17Oct 18, 2022Updated 3 years ago
- O Excel Structure Validator é um projeto Python destinado a validar a estrutura de arquivos Excel.☆13Oct 25, 2023Updated 2 years ago