anjalysam / Hadoop
This contain how to install Hadoop on google colab and how to run map-reduce in Hadoop
☆33Updated 4 years ago
Alternatives and similar repositories for Hadoop
Users that are interested in Hadoop are comparing it to the libraries listed below
Sorting:
- Build an interactive web app with streamlit and scikit-learn☆116Updated last year
- ☆29Updated 2 years ago
- This is code depository for my upcoming session. Will update details post the session☆40Updated 2 years ago
- An end-to-end project on customer segmentation☆81Updated 2 years ago
- A starter notebook for the Kitchenware classification competition on Kaggle☆15Updated 2 years ago
- I am using confluent Kafka cluster to produce and consume scraped data. In this project, I've created a real-time data pipeline that uti…☆29Updated 2 years ago
- ☆34Updated last year
- The Ultimate Hands-On Hadoop - Tame your Big Data!: https://www.udemy.com/the-ultimate-hands-on-hadoop-tame-your-big-data/☆8Updated 6 years ago
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- PyCaret deployment on Google Cloud Platform☆54Updated 2 years ago
- Notebook to walk through Bayesian testing with Kaggle data☆39Updated 3 years ago
- ☆101Updated 2 years ago
- Data Science Portfolio of Tatev Karen Aslanyan including Case Studies and Research Projects that I have completed that solve business pro…☆63Updated last year
- A pipeline to detect data drift and retrain the model when there is drift☆23Updated last year
- This repository contains the code for a realtime election voting system. The system is built using Python, Kafka, Spark Streaming, Postgr…☆37Updated last year
- ☆13Updated 4 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆42Updated last year
- Homework and notes for the DataTalks.Club MLOps Zoomcamp☆10Updated 2 years ago
- The getting started notebook for the DTC Zoomcamp Q&A challenge☆29Updated last year
- ☆25Updated 3 years ago
- ☆68Updated last year
- The Data Pipeline and Analytics Stack is a comprehensive solution designed for processing, storing, and visualizing data. Explore a compl…☆13Updated last year
- Repository for the book Simplifying Machine Learning with PyCaret.☆66Updated 2 years ago
- This repo contains Data Science code snippet☆82Updated 7 months ago
- Code for my data science articles☆46Updated 2 years ago
- Repository for Data Engineering Interview Series☆31Updated 7 months ago
- ☆34Updated 2 years ago
- Machine Learning Model Serving Patterns and Best Practices☆35Updated last year
- Turning salesforce lead, oppty, & sales activities data => Sales predictions using pandas, Scikit-learn, SQLAlchemy, Redshift, XGBoost Cl…☆27Updated 4 years ago
- Useful data science and Python code snippets at Data Science Simplified☆70Updated 3 years ago