jfchen / Spark-SQL-Twitter-Analyzer
Process large amount of Twitter data using Spark SQL (and its JSON support). Answers questions like "What are the most popular languages?", "Who is most influential?", "Which time zones are most active during a day?" and more.
☆9Updated 9 years ago
Related projects ⓘ
Alternatives and complementary repositories for Spark-SQL-Twitter-Analyzer
- Examples of Integrating Spark Streaming, Flume, and HBase to solve Streaming problems☆19Updated 10 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆21Updated 8 years ago
- Spark in Kaggle competitions☆9Updated 8 years ago
- An Apache Spark app for making data movement between Apache Hive and Apache Phoenix/HBase☆14Updated 8 years ago
- Kaggle's click through rate prediction with Spark Pipeline API☆23Updated 8 years ago
- Film recommendations with Apache Spark and Python☆61Updated 9 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 8 years ago
- tutorials and samples that show you how get the most out of IBM Analytics for Apache Spark☆79Updated 6 years ago
- ☆15Updated 7 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 9 years ago
- Spark Tutorial at the University of Maryland☆38Updated 10 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Example project to show how to use Spark to read and write Avro/Parquet files☆50Updated 11 years ago
- Real-time dashboard for Twitter Sentiment analysis using Spark Streaming and Watson Tone Analyzer☆31Updated 5 years ago
- ☆11Updated 10 years ago
- real time log event processing using spark, kafka & cassandra☆13Updated 9 years ago
- A real time streaming implementation of markov chain based fraud detection☆24Updated 9 years ago
- ☆48Updated 8 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- A Real-Time Analytical Processing (RTAP) example using Spark/Shark☆51Updated 10 years ago
- ☆41Updated 8 years ago
- Data and code for "Fast Data Applications with Spark and Python"☆25Updated 8 years ago
- A subproject of Predictiveworks that provides common access to Cassandra, Elasticsearch, HBase, MongoDB, Parquet, JDBC database and other…☆13Updated 9 years ago
- Sample custom Nifi processor to process tcpdump☆18Updated 9 years ago
- An Ambari Stack service package for VNC Server with the ability to install developer tools like Eclipse/IntelliJ/Maven as well to 'remote…☆28Updated 8 years ago
- A spark sbt blueprint to build your own spark apps off of (for cloud native runtime, see the kube/spark examples)☆55Updated 5 years ago