arun-kambhammettu / Python_Hadoop_BigData
A project using Airline data, to check aircraft details and Business class passenger details on various questions using Hadoop, Python functions, HIVE, PIG scripts, HQL.
☆9Updated 8 years ago
Alternatives and similar repositories for Python_Hadoop_BigData:
Users that are interested in Python_Hadoop_BigData are comparing it to the libraries listed below
- Analytics projects using Big Data eco-systems (Hadoop, Spark, Storm)☆16Updated 3 years ago
- This repository hosts the code/projects/demos/slides for Big Data technologies under Apache Hadoop and Apache Spark umbrella.☆42Updated 2 years ago
- Big Data Real Time Projects☆22Updated 7 years ago
- Apache Spark using SQL☆14Updated 3 years ago
- This is a guided certification project, as a part of Data Science for Social Good initiative☆17Updated 4 years ago
- Big data projects implemented by Maniram yadav☆52Updated 6 years ago
- All my projects on Big Data are provided☆27Updated 8 years ago
- Zomato Restaurants Exploratory Data Analysis, Visualization and Prediction with Sentiment Analysis of Reviews and Recommendation System☆71Updated 4 years ago
- A big data project to apply Hadoop map- reduce to derive some statistics from IMDB movie data.☆26Updated 10 years ago
- ☆10Updated 2 years ago
- Hadoop Project☆12Updated 7 years ago
- This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.☆53Updated 6 years ago
- plan, design and implement enterprise data infrastructure solutions and create the blueprints for an organization’s data management syste…☆11Updated last year
- Implementation of Spark code in Jupyter notebook. Topics include: RDDs and DataFrame, exploratory data analysis (EDA), handling multiple …☆29Updated 4 years ago
- Open Machine learning Projects☆98Updated last year
- Spark implementation of Slowly Changing Dimension type 2☆11Updated 6 years ago
- Course on Udemy by Jose Portilla☆97Updated 7 years ago
- Counting Tweets Per User in Real-Time☆41Updated 7 years ago
- ☆19Updated 5 years ago
- ☆19Updated 6 years ago
- Portofolio repository for Udacity Data Scientist Nanodegree☆41Updated 4 years ago
- This repository contains the hackerrank statistics challenge code☆51Updated 4 years ago
- UCSD Big Data Specialization General Materials and my Capstone Project.☆21Updated 6 years ago
- This repo is mostly created for pyspark and hive related interview questions.☆47Updated 3 years ago
- A repository for multiple end to end small machine learning and deep learning projects from scratch to production☆22Updated 2 years ago
- Hadoop tutorial Files. For detailed Tutorials visit www.youtube.com/learningjournalin☆26Updated 7 years ago
- This is a flask based app to scrap user reviews and comments from a retail website and generates word-cloud with CSV data available to do…☆29Updated 2 years ago
- Apache Spark Interview Question and Answers☆21Updated 4 years ago
- Personal project where I perform some analytics (including Sentiment Analysis) over a Twitter Stream using Big Data Technologies of the H…☆21Updated last year
- ☆19Updated 2 years ago