anjalysam / HadoopLinks
This contain how to install Hadoop on google colab and how to run map-reduce in Hadoop
☆33Updated 5 years ago
Alternatives and similar repositories for Hadoop
Users that are interested in Hadoop are comparing it to the libraries listed below
Sorting:
- ☆29Updated 3 years ago
- Project for real-time anomaly detection using Kafka and python☆59Updated 3 years ago
- A master repository of all Data Science projects, concepts, tools and resources that I learn and write about on my blog.☆121Updated 5 months ago
- Classwork projects and home works done through Udacity data engineering nano degree☆75Updated 2 years ago
- Maternal Health Risk prediction MLOps pipeline☆46Updated 3 years ago
- Create a streaming data, transfer it to Kafka, modify it with PySpark, take it to ElasticSearch and MinIO☆65Updated 2 years ago
- Produce Kafka messages, consume them and upload into Cassandra, MongoDB.☆43Updated 2 years ago
- Machine Learning for Streaming Data with Python, published by Packt☆73Updated last month
- ☆25Updated 3 years ago
- Deploy ML models with FastAPI, Docker, and Heroku☆88Updated 3 years ago
- The practical use-cases of how to make your Machine Learning Pipelines robust and reliable using Apache Airflow.☆51Updated 3 years ago
- An end-to-end project on customer segmentation☆83Updated 3 years ago
- ☆63Updated 5 years ago
- Mastering Big Data Analytics with PySpark, Published by Packt☆165Updated last year
- ☆32Updated 4 years ago
- Complete PySpark Guide for the beginners... I prepared this notebook for my students.☆18Updated 6 years ago
- mlops-projects-course☆146Updated 2 years ago
- PySpark Tutorial for Beginners - Practical Examples in Jupyter Notebook with Spark version 3.4.1. The tutorial covers various topics like…☆140Updated 2 years ago
- The Machine Learning Solutions Architect Handbook, published by Packt☆147Updated last month
- ML Zoomcamp fall 2021 homework and stuff☆67Updated 3 years ago
- Different tutorials how to deploy Machine Learning models☆114Updated 3 years ago
- Databricks Certified Associate Spark Developer preparation toolkit to setup single node Standalone Spark Cluster along with material in t…☆30Updated last year
- Predict Customer Churn in Python☆46Updated 5 years ago
- Useful data science and Python code snippets at Data Science Simplified☆71Updated 4 years ago
- plan, design and implement enterprise data infrastructure solutions and create the blueprints for an organization’s data management syste…☆14Updated 2 years ago
- A Series of Notebooks on how to start with Kafka and Python☆151Updated 11 months ago
- Series follows learning from Apache Spark (PySpark) with quick tips and workaround for daily problems in hand☆56Updated 2 years ago
- NLP Cheat Sheet, Python, spacy, LexNPL, NLTK, tokenization, stemming, sentence detection, named entity recognition☆254Updated 2 years ago
- MLOps for deploying a Credit Risk model☆34Updated 2 years ago
- Salary Prediction Web App With Streamlit☆96Updated last year