jfchen / Spark-SQL-Twitter-AnalyzerLinks
Process large amount of Twitter data using Spark SQL (and its JSON support). Answers questions like "What are the most popular languages?", "Who is most influential?", "Which time zones are most active during a day?" and more.
☆9Updated 10 years ago
Alternatives and similar repositories for Spark-SQL-Twitter-Analyzer
Users that are interested in Spark-SQL-Twitter-Analyzer are comparing it to the libraries listed below
Sorting:
- Data and code for "Fast Data Applications with Spark and Python"☆25Updated 8 years ago
- Film recommendations with Apache Spark and Python☆61Updated 10 years ago
- Fast-Data-Processing-with-Spark-2☆22Updated 2 years ago
- Public code files for the DDL blog☆56Updated 7 years ago
- Real-time dashboard for Twitter Sentiment analysis using Spark Streaming and Watson Tone Analyzer☆31Updated 6 years ago
- tutorials and samples that show you how get the most out of IBM Analytics for Apache Spark☆79Updated 7 years ago
- Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS☆8Updated 10 years ago
- Spark in Kaggle competitions☆10Updated 9 years ago
- 4th Place Solution for The Hunt for Prohibited Content Competition on Kaggle (http://www.kaggle.com/c/avito-prohibited-content)☆28Updated 10 years ago
- Repo for the Insults Detection challenge on Kaggle.com☆11Updated 12 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- Additional files for the Otto Group Challenge hosted by Kaggle☆37Updated 10 years ago
- Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.☆27Updated 9 years ago
- Apache Toree quickstart tutorial☆29Updated 9 years ago
- A demo of how to use PageRank with Hadoop and SociaLite to identify anomalies in Healthcare Data☆47Updated 9 years ago
- ☆48Updated 9 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆31Updated 9 years ago
- Zeppelin notebook examples☆25Updated 9 years ago
- ☆35Updated 2 years ago
- Training materials for Strata, AMP Camp, etc☆149Updated 9 years ago
- A simple example application that will connect to the Twitter API, run a search, gather tweets, and then calculate the sentiment of each …☆65Updated 9 years ago
- Examples of Integrating Spark Streaming, Flume, and HBase to solve Streaming problems☆19Updated 11 years ago
- Code example to predict prices of Airbnb vacation rentals, using scikit-learn on Spark with spark-sklearn, on MapR.☆44Updated 8 years ago
- A chef cookbook for deploying spark☆30Updated 12 years ago
- Computes and visualizes the sentiment analysis of tweets of US States in real-time using Storm.☆26Updated 10 years ago
- Large-scale ML & graph analytics on Giraph☆78Updated 9 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- A spark sbt blueprint to build your own spark apps off of (for cloud native runtime, see the kube/spark examples)☆56Updated 6 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆45Updated 8 years ago