dineshappavoo / IMDBMovieBigData
A big data project to apply Hadoop map- reduce to derive some statistics from IMDB movie data.
☆26Updated 10 years ago
Alternatives and similar repositories for IMDBMovieBigData:
Users that are interested in IMDBMovieBigData are comparing it to the libraries listed below
- All my projects on Big Data are provided☆27Updated 8 years ago
- Big Data (Hadoop): Twitter Analysis☆20Updated 9 years ago
- Twitter Sentiment Analysis using Spark and Kafka☆115Updated 5 years ago
- A project using Airline data, to check aircraft details and Business class passenger details on various questions using Hadoop, Python f…☆9Updated 9 years ago
- This repository hosts the code/projects/demos/slides for Big Data technologies under Apache Hadoop and Apache Spark umbrella.☆42Updated 2 years ago
- Collection of Pig scripts that I use for my talks and workshops☆40Updated 11 years ago
- Map-reduce, streaming analysis, and external memory algorithms and their implementation using the Hadoop and its eco-system: HBase, Hive,…☆34Updated 8 years ago
- Spark Application for analysis of Apache Access logs and detect anamolies! Along with Medium Article.☆20Updated 6 years ago
- Final Project for IoT: Big Data Processing and Analytics class. Analyzing U.S nationwide temperature from IoT sensors in real-time☆70Updated 8 years ago
- Churn Prediction with PySpark using MLlib and ML Packages☆56Updated 9 years ago
- Analytics projects using Big Data eco-systems (Hadoop, Spark, Storm)☆17Updated 3 years ago
- Educational notes,Hands on problems w/ solutions for hadoop ecosystem☆87Updated 6 years ago
- ☆37Updated 8 years ago
- Analyze and visualize Twitter Sentiment on a world map using Spark MLlib☆139Updated 3 years ago
- Final project for Udacity's ud741 — Unsupervised Learning☆52Updated 9 years ago
- Here's how to get DataQuest's Data Engineering Track missions' content to work on your localhost. Using data from my Valenbisi ARIMA mode…☆15Updated 6 years ago
- Counting Tweets Per User in Real-Time☆42Updated 7 years ago
- An analysis on Aadhaar dataset using Mapreduce and Spark☆14Updated 7 years ago
- Slowly Changing Dimension type 2 using Hive query language using exclusive join technique with ORC Hive tables, partitioned and clustered…☆16Updated 5 years ago
- ETL pipeline using pyspark (Spark - Python)☆114Updated 5 years ago
- This repository contains code examples for the course CS 20SI: TensorFlow for Deep Learning Research.☆12Updated 8 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Introduction to Big Data with Apach…☆115Updated 8 months ago
- Archived work from Udacity nanodegrees☆70Updated 3 years ago
- Ingest tweets with Kafka. Use Spark to track popular hashtags and trendsetters for each hashtag☆29Updated 9 years ago
- Big Data Management and Analysis Final Project☆68Updated 7 years ago
- Learn to build a data pipeline with Airflow to automate wrangling data - An Udacity Data Engineer Nano Degree Project☆8Updated 5 years ago
- Personal project where I perform some analytics (including Sentiment Analysis) over a Twitter Stream using Big Data Technologies of the H…☆21Updated 2 years ago
- My submissions for the Coursera MOOC "Big Data Analysis with Scala and Spark" given by EPFL.☆52Updated 8 years ago
- Real-time Machine Learning with Apache Spark on Twitter Public Stream☆68Updated 7 years ago
- Big Data for Data Engineers Coursera Specialization from Yandex☆102Updated 2 years ago