jfchen / Spark-SQL-Twitter-Analyzer
Process large amount of Twitter data using Spark SQL (and its JSON support). Answers questions like "What are the most popular languages?", "Who is most influential?", "Which time zones are most active during a day?" and more.
☆9Updated 10 years ago
Alternatives and similar repositories for Spark-SQL-Twitter-Analyzer
Users that are interested in Spark-SQL-Twitter-Analyzer are comparing it to the libraries listed below
Sorting:
- A chef cookbook for deploying spark☆30Updated 12 years ago
- Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.☆27Updated 8 years ago
- Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS☆8Updated 10 years ago
- Real-time dashboard for Twitter Sentiment analysis using Spark Streaming and Watson Tone Analyzer☆31Updated 6 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- A demo of how to use PageRank with Hadoop and SociaLite to identify anomalies in Healthcare Data☆47Updated 9 years ago
- Film recommendations with Apache Spark and Python☆61Updated 9 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- ☆20Updated 8 years ago
- Fraud Detection Online (Hadoop application)☆18Updated 11 years ago
- ☆24Updated 10 years ago
- Spark Tutorial at the University of Maryland☆38Updated 10 years ago
- tutorials and samples that show you how get the most out of IBM Analytics for Apache Spark☆79Updated 7 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- Exploration Library in Java☆12Updated last year
- Templates for projects based on top of H2O.☆38Updated 2 months ago
- Zeppelin notebook examples☆26Updated 9 years ago
- ☆15Updated 7 years ago
- The code for the in memory data pipeline that was presented at Berlin Buzzwords 2015.☆10Updated 9 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆45Updated 8 years ago
- real time log event processing using spark, kafka & cassandra☆13Updated 10 years ago
- A real time streaming implementation of markov chain based fraud detection☆23Updated 10 years ago
- Data and code for "Fast Data Applications with Spark and Python"☆25Updated 8 years ago
- This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…☆30Updated 9 years ago
- A single docker image that combines Neo4j Mazerunner and Apache Spark GraphX into a powerful all-in-one graph processing engine☆46Updated 5 years ago
- PMML evaluator library for the Apache Hive data warehouse software (legacy codebase)☆13Updated 10 years ago
- Fast-Data-Processing-with-Spark-2☆22Updated 2 years ago
- Spark in Kaggle competitions☆10Updated 9 years ago
- ☆21Updated 10 years ago