arun-kambhammettu / Python_Hadoop_BigData
A project using Airline data, to check aircraft details and Business class passenger details on various questions using Hadoop, Python functions, HIVE, PIG scripts, HQL.
☆9Updated 8 years ago
Alternatives and similar repositories for Python_Hadoop_BigData:
Users that are interested in Python_Hadoop_BigData are comparing it to the libraries listed below
- All my projects on Big Data are provided☆27Updated 8 years ago
- A big data project to apply Hadoop map- reduce to derive some statistics from IMDB movie data.☆26Updated 10 years ago
- Big data projects implemented by Maniram yadav☆51Updated 6 years ago
- This repository hosts the code/projects/demos/slides for Big Data technologies under Apache Hadoop and Apache Spark umbrella.☆42Updated 2 years ago
- Personal project where I perform some analytics (including Sentiment Analysis) over a Twitter Stream using Big Data Technologies of the H…☆21Updated last year
- This is a guided certification project, as a part of Data Science for Social Good initiative☆17Updated 5 years ago
- This is a flask based app to scrap user reviews and comments from a retail website and generates word-cloud with CSV data available to do…☆29Updated 2 years ago
- This repository focuses on gathering and making a curated list resources to learn Hadoop for FREE.☆54Updated 6 years ago
- Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.☆20Updated 6 years ago
- Hadoop tutorial Files. For detailed Tutorials visit www.youtube.com/learningjournalin☆26Updated 7 years ago
- This project is mainly for learning and practicing simple HIVE commands in real time scenarios. Here we have taken some sample coffee sho…☆11Updated 7 years ago
- Big Data Real Time Projects☆23Updated 7 years ago
- Counting Tweets Per User in Real-Time☆42Updated 7 years ago
- Data cleaning, pre-processing, and Analytics on a Health care data using Spark and Python.☆47Updated last year
- data engineering 100 days 🤖 🧲 🦾 | #DE☆40Updated last year
- Published by Packt☆114Updated 2 years ago
- Hadoop Project☆13Updated 7 years ago
- A Project where one can fetch and read tweets and show the analysis like who is most influential☆28Updated last year
- Data cleaning, pre-processing, and Analytics on a million movies using Spark and Scala.☆94Updated 3 years ago
- Apache Spark using SQL☆14Updated 3 years ago
- Collection of Python solutions for problems for various competitive programming sites☆14Updated 2 weeks ago
- ☆47Updated 4 years ago
- Hackerank Programming Challenges☆9Updated 3 years ago
- ☆114Updated 4 years ago
- ☆12Updated 4 years ago
- Learn Machine Learning using PySpark from scratch☆19Updated 6 years ago
- Develop ML models predict taxi trip duration in NYC. Ranked : Top 6% | RMSLE : 0.377 (Kaggle) | #DS☆17Updated 2 years ago
- Analysis of restaurant ratings data to gain insights into the performance of various restaurants..☆15Updated 2 years ago
- ☆15Updated last year
- Big Data (Hadoop): Twitter Analysis☆20Updated 9 years ago