anjalysam / Hadoop
This contain how to install Hadoop on google colab and how to run map-reduce in Hadoop
☆33Updated 4 years ago
Alternatives and similar repositories for Hadoop:
Users that are interested in Hadoop are comparing it to the libraries listed below
- Built a Data Pipeline for a Retail store using AWS services that collects data from its transactional database (OLTP) in Snowflake and tr…☆9Updated last year
- Resources and projects from Udacity Data Engineering with AWS nano degree programme☆25Updated 2 years ago
- ☆34Updated last year
- Classwork projects and home works done through Udacity data engineering nano degree☆74Updated last year
- PyCaret deployment on Google Cloud Platform☆54Updated 2 years ago
- Machine Learning for Streaming Data with Python, published by Packt☆70Updated last year
- ☆25Updated 2 years ago
- ☆21Updated last year
- Comet for Data Science, published by Packt☆42Updated last year
- This repo contains all the material developed during the 9-week bootcamp provided by DPhi in colaboration with DataTalks Club☆21Updated 2 years ago
- Portofolio repository for Udacity Data Scientist Nanodegree☆41Updated 4 years ago
- This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenA…☆34Updated last year
- Essential PySpark for Scalable Data Analytics, published by Packt☆44Updated 2 years ago
- MLflow related work☆38Updated last year
- Final Project of the MLOps Zoomcamp hosted by DataTalksClub.☆26Updated 2 years ago
- Mastering Big Data Analytics with PySpark, Published by Packt☆158Updated 8 months ago
- Course Material - Data Science Program☆14Updated last year
- A repo for all the relevant code notebooks and datasets used in my Machine Learning tutorial videos on YouTube☆72Updated 3 years ago
- Built a movie recommender system with Streamlit and deploy in Heroku Platform.☆9Updated 3 years ago
- This repo is meant to make it really easy to analyze the interplays between health and social media use.☆43Updated 2 years ago
- IBM Data Engineering Courses from Coursera☆71Updated 2 years ago
- Data Cleaning and Exploration with Machine Learning☆53Updated 2 years ago
- ☆25Updated last month
- Step by step instructions to create a production-ready data pipeline☆45Updated 4 months ago
- A Series of Notebooks on how to start with Kafka and Python☆154Updated last month
- This repo includes all exercises for courses and projects that I have finished on datacamp.☆15Updated last year
- Machine Learning Engineering on AWS, published by Packt☆67Updated last year
- Here I will be exploring various tools and methods that are used in data engineering process with Python.☆22Updated 4 years ago
- Project for real-time anomaly detection using Kafka and python☆58Updated 2 years ago
- Analysis of 311 Service Requests for the City of NYC (from 2010 to 2023) Tech: Prefect cloud, dbt core, BigQuery, Compute Engine, CloudRu…☆20Updated 2 years ago