scalding-io / social-media-analytics
Social Media Data Mining and Analytics - HyperLogLog, BloomFilter and CountMinSketch with Scalding & Algebird
☆27Updated 6 years ago
Alternatives and similar repositories for social-media-analytics:
Users that are interested in social-media-analytics are comparing it to the libraries listed below
- Word2Vec models with Twitter data using Spark. Blog:☆65Updated 6 years ago
- A real time streaming implementation of markov chain based fraud detection☆24Updated 10 years ago
- Tweet Analysis with Spark☆15Updated 7 years ago
- Machine Learning over Twitter's stream. Using Apache Spark, Web Server and Lightning Graph server.☆27Updated 8 years ago
- ☆19Updated 8 years ago
- This project contains the code to translate between Apache Spark and SFrame.☆20Updated 8 years ago
- ☆20Updated 8 years ago
- ☆35Updated 2 years ago
- An API for Distributed Machine Learning☆154Updated 8 years ago
- Additional useful algorithms that can be used with spark.☆24Updated 10 years ago
- Fraud Detection Online (Hadoop application)☆18Updated 10 years ago
- Predicting The Stock Market using Time Series Analysis and Media☆10Updated 10 years ago
- Assembly of fundamental statistics implemented based on Apache Spark☆31Updated 9 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 8 years ago
- SmallK: very fast data clustering tools☆14Updated 5 years ago
- Data Science in Scala - Conf. Talk Repo☆15Updated 8 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆92Updated 9 years ago
- Some notebook examples related to Apache Spark, IPython / Jupyter, Zeppelin☆52Updated 8 years ago
- Streaming tweets with spark, language detection & sentiment analysis, dashboard with Kibana☆103Updated 9 years ago
- Java implementation of the Microsoft's AdPredictor algorithm☆17Updated 6 years ago
- Experiments on english wikipedia. GloVe and word2vec.☆13Updated 9 years ago
- Spark library for doing exploratory data analysis in a scalable way☆43Updated 9 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- Examples for Fast Data Processing with Spark☆59Updated 11 years ago
- Distributed Streaming Quantiles (for PySpark)☆38Updated 11 years ago
- How to use automatic polynomial features and neural network mode in VW☆17Updated 10 years ago
- Spark in Kaggle competitions☆9Updated 8 years ago
- Templates for projects based on top of H2O.☆37Updated 4 months ago
- Data and code for "Fast Data Applications with Spark and Python"☆25Updated 8 years ago
- ☆35Updated 8 years ago