Quocc1 / OpenStackLinks
An end-to-end open-source data stack for crawling and visualizing real estate data, facilitating insights into market trends.
☆15Updated last year
Alternatives and similar repositories for OpenStack
Users that are interested in OpenStack are comparing it to the libraries listed below
Sorting:
- This project uses PySpark and Python to analyze a Google Play Store dataset. It covers data cleaning, duplicate removal, and visual analy…☆12Updated 3 years ago
- A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive M…☆47Updated last year
- This repo gives an introduction to setting up streaming analytics using open source technologies☆25Updated 2 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, duckdb and Superset☆46Updated last month
- ☆12Updated 2 years ago
- Deploy a Streamlit app to train, evaluate and optimize a Prophet forecasting model visually.☆46Updated 2 years ago
- build dw with dbt☆50Updated last year
- ☆16Updated last year
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆44Updated last year
- Notebooks for exploring prediction markets (eg. Kalshi, Polymarket, ForecastTrader)☆25Updated last year
- Delta-Lake, ETL, Spark, Airflow☆48Updated 3 years ago
- Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…☆35Updated last month
- Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…☆75Updated 2 years ago
- Data Engineering Project to Extract and Process Solana Reddit Data☆40Updated 2 years ago
- Simulation of job offers and CVs with real-time processing, classification, and analytics using Kafka, Ray, Spark, and Databricks. Includ…☆14Updated last year
- Building a Data Pipeline with an Open Source Stack☆55Updated 7 months ago
- Deploying a Machine Learning model streaming application with Apache Kafka☆11Updated 3 years ago
- Git Repo for EDW Best Practice Assets on the Lakehouse☆16Updated 2 years ago
- Example code for the dbt core Learn tutorial. The Astro dbt provider, also known as Cosmos, is a tool automatically integrate dbt models …☆17Updated 11 months ago
- Cost Efficient Data Pipelines with DuckDB☆61Updated 8 months ago
- Dynamic batching for Document Layout and OCR, suitable for RAG, with extra tools.☆14Updated last year
- Fivetran data models for Facebook Ads built using dbt.☆44Updated 2 weeks ago
- ☆41Updated 3 years ago
- ☆21Updated last year
- trino + hive + minio with postgres in docker compose☆27Updated 2 years ago
- Simple HR Chatbot for Onepoint using Chainlit☆21Updated 2 years ago
- Code for dbt tutorial☆167Updated 4 months ago
- Streamlit application to explore Snowflake Tables☆49Updated 2 years ago
- reating a modern data pipeline using a combination of Terraform, AWS Lambda and S3, Snowflake, DBT, Mage AI, and Dash.☆15Updated 2 years ago
- Apache Airflow advanced functionalities examples☆21Updated last year