hicder / muopdb
MuopDB - A Vector Database
☆37Updated this week
Related projects ⓘ
Alternatives and complementary repositories for muopdb
- Vietnam stock price crawling☆18Updated last year
- Open source stack lakehouse☆25Updated 8 months ago
- How to build an awesome data engineering team☆99Updated 5 years ago
- 📚 Tech blogs & talks by companies that run Apache Flink in production☆156Updated this week
- Nyc_Taxi_Data_Pipeline - DE Project☆83Updated 3 weeks ago
- End-to-end data platform: A PoC Data Platform project utilizing modern data stack (Spark, Airflow, DBT, Trino, Lightdash, Hive metastore,…☆15Updated 3 weeks ago
- Code snippets for Data Engineering Design Patterns book☆38Updated this week
- "Nature's economy shall be the base for our own, for it is immutable, but ours is secondary. An economist without knowledge of nature is …☆18Updated 3 years ago
- This is dotfile, to Setup Development Environment as Data Engineer☆18Updated 3 months ago
- Data pipeline project☆23Updated last year
- ☆43Updated 3 months ago
- Amazon EMR Serverless and Amazon MSK Serverless Demo☆13Updated 2 years ago
- ( These solutions tested on 4 node Hortonwork cluster on my laptop. Do not test on your production environment until you test... :)☆20Updated 4 years ago
- Data Structures and Algorithms☆17Updated this week
- Automatic data extraction from SAP by using Python☆9Updated 4 years ago
- A custom end-to-end data pipeline for customer churn☆9Updated last week
- Code for Apache Hudi, Apache Iceberg and Delta Lake analysis☆9Updated 9 months ago
- Lưu trữ và xử lý dữ liệu lớn☆8Updated 3 years ago
- A repository of blogs/videos that presents how Apache Iceberg is being used in Production by various orgs☆13Updated last year
- RedditR for Content Engagement and Recommendation☆21Updated 6 years ago
- data engineering 100 days 🤖 🧲 🦾 | #DE☆37Updated last year
- Repo for everything open table formats (Iceberg, Hudi, Delta Lake) and the overall Lakehouse architecture☆28Updated last week
- ☆13Updated 5 years ago
- Journey to Become a DevOps☆15Updated 3 months ago
- A full big data pipeline (Lambda Architecture) with Spark, Kafka, HDFS and Cassandra.☆172Updated last year
- Simple stream processing pipeline☆91Updated 4 months ago
- Data engineering interviews Q&A for data community by data community☆61Updated 4 years ago
- 💜🌈📊 A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Pola…☆17Updated 4 months ago
- ☆24Updated 3 months ago
- Udacity's Product Manager course☆17Updated 4 years ago
- Airflow training for the crunch conf☆105Updated 6 years ago