jfchen / Spark-SQL-Twitter-AnalyzerLinks

Process large amount of Twitter data using Spark SQL (and its JSON support). Answers questions like "What are the most popular languages?", "Who is most influential?", "Which time zones are most active during a day?" and more.

☆9

Alternatives and similar repositories for Spark-SQL-Twitter-Analyzer

Users that are interested in Spark-SQL-Twitter-Analyzer are comparing it to the libraries listed below

Sorting:

ibm-watson-data-lab / spark.samples
tutorials and samples that show you how get the most out of IBM Analytics for Apache Spark
☆79Updated 7 years ago
ibm-watson-data-lab / Spark-Twitter-Watson-Dashboard
Real-time dashboard for Twitter Sentiment analysis using Spark Streaming and Watson Tone Analyzer
☆31Updated 6 years ago
marklit / recommend
Film recommendations with Apache Spark and Python
☆61Updated 10 years ago
giorgioinf / twitter-stream-ml
Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.
☆27Updated 9 years ago
AtlasPilotPuppy / SparkAlgorithms
Additional useful algorithms that can be used with spark.
☆24Updated 10 years ago
amplab / training
Training materials for Strata, AMP Camp, etc
☆149Updated 9 years ago
DistrictDataLabs / blog-files
Public code files for the DDL blog
☆56Updated 7 years ago
ofermend / medicare-demo
A demo of how to use PageRank with Hadoop and SociaLite to identify anomalies in Healthcare Data
☆47Updated 9 years ago
dipanjanS / BerkeleyX-CS190.1x-Scalable-Machine-Learning
This repository contains code files specifically IPython notebooks for the assignments in the course "Scalable Machine Learning" by UC Be…
☆31Updated 10 years ago
DistrictDataLabs / spark-workshop
Data and code for "Fast Data Applications with Spark and Python"
☆25Updated 8 years ago
jmankoff / data
The repository for the CMU Data Pipeline course. This year's course should use branch 2017
☆40Updated 8 years ago
felixcheung / spark-notebook-examples
Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin
☆52Updated 9 years ago
wattsteve / pyspark-tutorial
Tutorial for Deploying Anaconda Cluster and PySpark on top of Red Hat Storage GlusterFS
☆8Updated 10 years ago
ceteri / spark-exercises
Coding exercises for Apache Spark
☆104Updated 10 years ago
brett-pplx / AMC
Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"
☆21Updated 9 years ago
bhomass / marseille
A real time streaming implementation of markov chain based fraud detection
☆23Updated 10 years ago
adobe-research / spark-gpu
GPU Acceleration for Apache Spark
☆34Updated 9 years ago
PacktPublishing / Fast-Data-Processing-with-Spark-2
Fast-Data-Processing-with-Spark-2
☆22Updated 2 years ago
ottogroup / kaggle
Additional files for the Otto Group Challenge hosted by Kaggle
☆37Updated 10 years ago
rjurney / Agile_Data_Code
Chapter-wise code for Agile Data the O'Reilly book
☆159Updated 11 years ago
turi-code / spark-sframe
This project contains the code to translate between Apache Spark and SFrame.
☆20Updated 9 years ago
amir-rahnama / pyspark-twitter-stream-mining
Real-time Machine Learning with Apache Spark on Twitter Public Stream
☆68Updated 8 years ago
datalayer-attic / zeppelin
Apache Zeppelin on Kubernetes.
☆28Updated 6 years ago
asimjalis / apache-toree-quickstart
Apache Toree quickstart tutorial
☆29Updated 9 years ago
DhruvKumar / spark-twitter-sentiment
☆35Updated 2 years ago
Leemoonsoo / zeppelin-examples
Zeppelin notebook examples
☆25Updated 9 years ago
andreiolariu / kaggle-insults
Repo for the Insults Detection challenge on Kaggle.com
☆11Updated 12 years ago
burun / BerkeleyX-Apache-Spark-Labs
☆19Updated 8 years ago
mdymczyk / iot-pipeline
☆15Updated 7 years ago
holdenk / fastdataprocessingwithsparkexamples
Examples for Fast Data Processing with Spark
☆59Updated 11 years ago