arun-kambhammettu / Python_Hadoop_BigData
A project using Airline data, to check aircraft details and Business class passenger details on various questions using Hadoop, Python functions, HIVE, PIG scripts, HQL.
☆9Updated 9 years ago
Alternatives and similar repositories for Python_Hadoop_BigData:
Users that are interested in Python_Hadoop_BigData are comparing it to the libraries listed below
- A big data project to apply Hadoop map- reduce to derive some statistics from IMDB movie data.☆26Updated 10 years ago
- This repository hosts the code/projects/demos/slides for Big Data technologies under Apache Hadoop and Apache Spark umbrella.☆42Updated 2 years ago
- All my projects on Big Data are provided☆27Updated 8 years ago
- Big data projects implemented by Maniram yadav☆51Updated 6 years ago
- Apache Spark using SQL☆14Updated 3 years ago
- Hadoop Project☆13Updated 7 years ago
- Big Data (Hadoop): Twitter Analysis☆20Updated 9 years ago
- Analytics projects using Big Data eco-systems (Hadoop, Spark, Storm)☆17Updated 3 years ago
- An analysis on Aadhaar dataset using Mapreduce and Spark☆14Updated 7 years ago
- ☆45Updated 6 years ago
- This is a repository for my data engineer course through Udacity.☆16Updated 5 years ago
- At the time of exams most of the time student share their notes via social media and after the exam gets over it become really difficut t…☆14Updated 6 years ago
- Exploratory data analysis using Python, Numpy, Pandas, Seaborn for Carsale Advertisement Dataset☆8Updated 5 years ago
- This repo is mostly created for pyspark and hive related interview questions.☆47Updated 3 years ago
- Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time☆70Updated 8 years ago
- Big Data Real Time Projects☆23Updated 7 years ago
- Repository related to Spark SQL and Pyspark using Python3☆37Updated 2 years ago
- Machine Learning Case study on customer segmentation and prediction of groups.☆31Updated 6 years ago
- Develop ML models predict taxi trip duration in NYC. Ranked : Top 6% | RMSLE : 0.377 (Kaggle) | #DS☆17Updated 2 years ago
- This is a guided certification project, as a part of Data Science for Social Good initiative☆17Updated 5 years ago
- ☆115Updated 4 years ago
- ☆15Updated 3 years ago
- Learning from multiple companies in Silicon Valley. Netflix, Facebook, Google, Startups☆16Updated 6 years ago
- Zomato Restaurants Exploratory Data Analysis, Visualization and Prediction with Sentiment Analysis of Reviews and Recommendation System☆74Updated 4 years ago
- This project is mainly for learning and practicing simple HIVE commands in real time scenarios. Here we have taken some sample coffee sho…☆11Updated 7 years ago
- My solutions for the Udacity Data Engineering Nanodegree☆33Updated 5 years ago
- ☆16Updated 6 years ago
- This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.☆54Updated 6 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 5 years ago
- Personal project where I perform some analytics (including Sentiment Analysis) over a Twitter Stream using Big Data Technologies of the H…☆21Updated 2 years ago