vuthanhhai2302 / Apply-machine-learning-on-data-analyticsLinks
My project of applied machine learning on data analytics, using pandas, numpy and scikit-learn to analyze data
☆10Updated 2 years ago
Alternatives and similar repositories for Apply-machine-learning-on-data-analytics
Users that are interested in Apply-machine-learning-on-data-analytics are comparing it to the libraries listed below
Sorting:
- My applied big data analytic project with pyspark.☆10Updated 2 years ago
- Data Engineering project to analyse my streams 💪☆12Updated last year
- Multi-container environment with Hadoop, Spark and Hive☆215Updated last month
- ☆7Updated 2 years ago
- More than 2000+ Data engineer interview questions.☆1,357Updated 5 months ago
- Sample code and documentation for very basic things that I can't remember but want to aggregate in one place☆14Updated 3 years ago
- A data engineering project with Kafka, Spark Streaming, dbt, Docker, Airflow, Terraform, GCP and much more!☆716Updated 3 years ago
- Personal Data Engineering Projects☆942Updated 2 years ago
- Few projects related to Data Engineering including Data Modeling, Infrastructure setup on cloud, Data Warehousing and Data Lake developme…☆1,661Updated 2 years ago
- The Data Pipeline and Analytics Stack is a comprehensive solution designed for processing, storing, and visualizing data. Explore a compl…☆16Updated last year
- The resources of the preparation course for Databricks Data Engineer Associate certification exam☆429Updated this week
- End-to-end Data Project (DA/DS/DE/MLOps) - retail/e-commerce - interpretable dynamic clustering☆14Updated this week
- Apache Hadoop docker image☆2,265Updated last year
- PySpark Cheat Sheet - example code to help you learn PySpark and develop apps faster☆467Updated 8 months ago
- Data Engineering Project: Extracting music video metrics of Twice using YouTube API, AWS, and Tableau☆27Updated last year
- Beginner data engineering project - batch edition☆528Updated 5 months ago
- ☆76Updated 5 months ago
- ☆353Updated 5 months ago
- An end-to-end data engineering pipeline that orchestrates data ingestion, processing, and storage using Apache Airflow, Python, Apache Ka…☆257Updated 4 months ago
- Stream processing pipeline from Finnhub websocket using Spark, Kafka, Kubernetes and more☆349Updated last year
- StreamSoft enables real-time analysis of any stock market☆14Updated last year
- Main repository to collect notes and scripts written during DataExpert.IO January 2025 bootcamp to help anyone interested.☆24Updated 2 months ago
- My Insight Data Engineering Fellowship project. I implemented a big data processing pipeline based on lambda architecture, that aggrega…☆506Updated 2 years ago
- My study materials for Snowflake SnowPro Core Certification☆11Updated last year
- This is a repo with links to everything you'd ever want to learn about data engineering☆621Updated 6 months ago
- Ultimate guide for mastering Spark Performance Tuning and Optimization concepts and for preparing for Data Engineering interviews☆148Updated last year
- ☆151Updated 3 years ago
- ☆51Updated last year
- ☆47Updated last year
- Data Engineering YouTube Analysis Project by Darshil Parmar☆195Updated last year