jfchen / Spark-SQL-Twitter-Analyzer
Process large amount of Twitter data using Spark SQL (and its JSON support). Answers questions like "What are the most popular languages?", "Who is most influential?", "Which time zones are most active during a day?" and more.
☆9Updated 9 years ago
Alternatives and similar repositories for Spark-SQL-Twitter-Analyzer:
Users that are interested in Spark-SQL-Twitter-Analyzer are comparing it to the libraries listed below
- A real time streaming implementation of markov chain based fraud detection☆24Updated 10 years ago
- Real-time dashboard for Twitter Sentiment analysis using Spark Streaming and Watson Tone Analyzer☆31Updated 6 years ago
- Kaggle's click through rate prediction with Spark Pipeline API☆23Updated 9 years ago
- Spark in Kaggle competitions☆9Updated 8 years ago
- Fast-Data-Processing-with-Spark-2☆22Updated 2 years ago
- ☆11Updated 10 years ago
- Data and code for "Fast Data Applications with Spark and Python"☆25Updated 8 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- Spark Tutorial at the University of Maryland☆38Updated 10 years ago
- ☆20Updated 8 years ago
- tutorials and samples that show you how get the most out of IBM Analytics for Apache Spark☆79Updated 6 years ago
- Film recommendations with Apache Spark and Python☆61Updated 9 years ago
- graphx example☆24Updated 9 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- ☆42Updated 8 years ago
- A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other…☆13Updated 10 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.☆27Updated 8 years ago
- Repo for the Insults Detection challenge on Kaggle.com☆11Updated 11 years ago
- Real-time Machine Learning with Apache Spark on Twitter Public Stream☆68Updated 7 years ago
- Detecting outliers in a dataset using Spark☆41Updated 8 years ago
- A spark sbt blueprint to build your own spark apps off of (for cloud native runtime, see the kube/spark examples)☆56Updated 5 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 8 years ago
- Examples of Integrating Spark Streaming, Flume, and HBase to solve Streaming problems☆19Updated 11 years ago
- ☆15Updated 7 years ago
- ☆21Updated 10 years ago
- ☆35Updated 2 years ago
- Time series and energy data analysis API for Spark.☆19Updated 12 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago