dnguyenngoc / real-time-analyticLinks

This repo gives an introduction to setting up streaming analytics using open source technologies

☆25

Alternatives and similar repositories for real-time-analytic

Users that are interested in real-time-analytic are comparing it to the libraries listed below

Sorting:

lelouvincx / goodreads-elt-pipeline
This project implements an ELT (Extract - Load - Transform) data pipeline with the goodreads dataset, using dagster (orchestration), spar…
☆35Updated 2 years ago
trannhatnguyen2 / NYC_Taxi_Data_Pipeline
Nyc_Taxi_Data_Pipeline - DE Project
☆110Updated 8 months ago
longNguyen010203 / Youtube-Recommend-Master-ETL-Pipeline
A Data Engineering Project that implements an ETL data pipeline using Dagster, Apache Spark, Streamlit, MinIO, Metabase, Dbt, Polars, Doc…
☆20Updated 7 months ago
longbuivan / dotfile
My Setup Development Environment as Data Engineer
☆27Updated 3 weeks ago
MLOpsVN / mlops-crash-course-code
☆46Updated 2 years ago
dain55788 / ELT-Data-Pipeline
ELT Data Pipeline implementation in Data Warehousing environment
☆26Updated last month
leehuwuj / olh
Open source stack lakehouse
☆25Updated last year
ysfesr / Building-Data-LakeHouse
Creation of a data lakehouse and an ELT pipeline to enable the efficient analysis and use of data
☆46Updated last year
DatacollectorVN / Airflow-Tutorial
My self-learning about Apache Airflow
☆32Updated 2 years ago
Dorianteffo / modern-data-platform
End-to-end data platform leveraging the Modern data stack
☆49Updated last year
haicheviet / fullstack-machine-learning-inference
Fullstack machine learning inference template
☆30Updated last year
airscholar / modern-data-eng-dbt-databricks-azure
In this project, we setup and end to end data engineering using Apache Spark, Azure Databricks, Data Build Tool (DBT) using Azure as our …
☆32Updated last year
Stefen-Taime / Iceberg-Dbt-Trino-Hive-modern-open-source-data-stack
To provide a deeper understanding of how the modern, open-source data stack consisting of Iceberg, dbt, Trino, and Hive operates within a…
☆35Updated last year
simardeep1792 / Data-Engineering-Streaming-Project
☆41Updated 11 months ago
dunghoang369 / feature-store
☆65Updated last year
HungNguyenDev1511 / Car-detection-serving-model
☆60Updated 10 months ago
Drissdo185 / Text-Summarization
☆28Updated last year
canhtran / dgscli
Data Guy Story commandline
☆12Updated 2 years ago
Full-Stack-Data-Science / real-time-ml-inference-with-spark-streaming-and-kafka
FSDS Webinar 1: Real-Time Machine Learning Inference with Spark Streaming and Kafka
☆10Updated 4 months ago
harrydevforlife / building-lakehouse
Building Data Lakehouse by open source technology. Support end to end data pipeline, from source data on AWS S3 to Lakehouse, visualize a…
☆30Updated last year
developershomes / SparkETL
Spark all the ETL Pipelines
☆32Updated last year
quan-dang / kubeflow-tutorials
☆42Updated last year
raashidsalih / churn-pipeline
A custom end-to-end analytics platform for customer churn
☆12Updated last month
tuanht12 / news-summarization-api
☆24Updated last year
dominikhei / Local-Data-LakeHouse
Sample Data Lakehouse deployed in Docker containers using Apache Iceberg, Minio, Trino and a Hive Metastore. Can be used for local testin…
☆72Updated last year
ToDucThanh / MLOPS-human-pose-estimation
MLOPs human pose estimation end-to-end.
☆35Updated last year
bmd1905 / Customer-Purchase-Prediction-ML-System
A turnkey MLOps pipeline demonstrating how to go from raw events to real-time predictions at scale.
☆191Updated 5 months ago
HungNguyenDev1511 / Capstone-Project-Data-Engineer
☆29Updated last year
ongxuanhong / de03-trino-dbt-spark-everything-everywhere-all-at-once
☆16Updated last year
DucLong06 / face-detection-ml-system
This project demonstrates a production-grade MLOps pipeline that deploys a YOLOv11-based face detection service on Google Kubernetes Engi…
☆34Updated 2 weeks ago