Quocc1 / OpenStackLinks
An end-to-end open-source data stack for crawling and visualizing real estate data, facilitating insights into market trends.
☆15Updated last year
Alternatives and similar repositories for OpenStack
Users that are interested in OpenStack are comparing it to the libraries listed below
Sorting:
- Translation tests done with Helsinki-NLP☆15Updated 2 years ago
- This project uses PySpark and Python to analyze a Google Play Store dataset. It covers data cleaning, duplicate removal, and visual analy…☆12Updated 3 years ago
- A Docker Compose template that builds a interactive development environment for PySpark with Jupyter Lab, MinIO as object storage, Hive M…☆47Updated last year
- trino + hive + minio with postgres in docker compose☆27Updated 2 years ago
- This project aims at giving the best customer service ever using the power of LLM models like GPT.☆10Updated 2 years ago
- Data Engineering Project to Extract and Process Solana Reddit Data☆39Updated last year
- Simple HR Chatbot for Onepoint using Chainlit☆21Updated 2 years ago
- To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…☆44Updated last year
- A simple Data Engineering solution for testing or education purposes. You only need to know SQL and Python to understand this project. Da…☆28Updated 3 years ago
- A portable Datamart and Business Intelligence suite built with Docker, Airflow, dbt, duckdb and Superset☆46Updated 3 weeks ago
- Cost Efficient Data Pipelines with DuckDB☆60Updated 7 months ago
- F1 Data Pipeline☆24Updated 2 years ago
- Deploy a Streamlit app to train, evaluate and optimize a Prophet forecasting model visually.☆45Updated last year
- Live stream tweets based on keywords to database using SQLAlchemy. Tweets are assigned a sentiment score and data is presented via stream…☆43Updated 5 years ago
- This repo gives an introduction to setting up streaming analytics using open source technologies☆25Updated 2 years ago
- build dw with dbt☆51Updated last year
- A template app to build & deploy PandasAI app to make your csv files conversational☆18Updated 2 years ago
- ☆12Updated 2 years ago
- Building a Data Pipeline with an Open Source Stack☆55Updated 6 months ago
- Documentation for Ploomber Cloud☆37Updated 3 months ago
- Streamlit Dashboard over Superstore Data stored in Postgres Docker container. With SQLAlchemy + Plotly Express☆13Updated last year
- A real-time reddit data streaming pipeline for sentiment analysis of various subreddits☆141Updated 2 years ago
- New generation opensource data stack☆76Updated 3 years ago
- Production-ready Chainlit RAG application with Pinecone pipeline offering all Groq and OpenAI Models, to chat with your documents.☆11Updated 4 months ago
- ⚙️ Airflow data pipeline with Terraform, GCP BigQuery, dbt, Soda and Looker Studio.☆22Updated 2 years ago
- ☆31Updated 11 months ago
- AI leetcode interviewer that assesses tech applicants. Built on Langchain and OpenAI APIs. Recruiter-focused and tracks progress and subm…☆15Updated 2 years ago
- End-to-end data pipeline that ingests, processes, and stores data. It uses Apache Airflow to schedule scripts that fetch data from an API…☆20Updated last year
- Deploy a complete data stack in just a couple of minutes.☆15Updated last year
- Data Agents are intelligent assistants built by data engineers to help non-data professionals navigate the organization’s data infrastruc…☆19Updated 8 months ago