seinecle / UmigonLinks
Sentiment analysis for Twitter and social media
☆36Updated 2 years ago
Alternatives and similar repositories for Umigon
Users that are interested in Umigon are comparing it to the libraries listed below
Sorting:
- Word2Vec models with Twitter data using Spark. Blog:☆66Updated 6 years ago
- Raw Wikipedia counts for entity linking☆19Updated 8 years ago
- A demo of how to use PageRank with Hadoop and SociaLite to identify anomalies in Healthcare Data☆47Updated 9 years ago
- Movielens collaborative filtering with Solr streaming expression☆11Updated 9 years ago
- Tweet Analysis with Spark☆15Updated 8 years ago
- A Text Classification API in Java originally developed by DigitalPebble Ltd. The API is independent from the ML implementations used and …☆48Updated 4 years ago
- Dato/Turi DS Conf talk on NLP and Elasticsearch analysis of reviews, plus JS implementation☆45Updated 9 years ago
- NLP tools developed by Emory University.☆61Updated 9 years ago
- A toolkit that wraps various natural language processing implementations behind a common interface.☆101Updated 8 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 13 years ago
- Replication software, data, and supplementary materials for the paper: O'Connor, Stewart and Smith, ACL-2013, "Learning to Extract Intern…☆27Updated 4 years ago
- ☆20Updated 9 years ago
- Build intelligent data-driven applications with minimal effort. Sentence Clustering, Topics Extraction, Text Similarity, Opinion Summariz…☆41Updated 6 years ago
- Build tables of information by extracting facts from indexed text corpora via a simple and effective query language.☆56Updated 6 years ago
- Tools for building a Lucene index for Semantic Vectors☆21Updated 10 years ago
- Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP☆15Updated 9 years ago
- Turbo topics find significant multiword phrases in topics.☆46Updated 10 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago
- Distributed implementation of Robust PLSA using Spark☆12Updated 4 years ago
- Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly☆47Updated 5 years ago
- Experiments on english wikipedia. GloVe and word2vec.☆13Updated 9 years ago
- Spark implementation of the Google Correlate algorithm to quickly find highly correlated vectors in huge datasets☆92Updated 9 years ago
- A library of examples showing how to use the Common Crawl corpus (2008-2012, ARC format)☆65Updated 9 years ago
- A vector similarity database☆230Updated 11 years ago
- Social Media Data Mining and Analytics - HyperLogLog, BloomFilter and CountMinSketch with Scalding & Algebird☆27Updated 7 years ago
- Hadoop jobs for WikiReverse project. Parses Common Crawl data for links to Wikipedia articles.☆38Updated 7 years ago
- A collection of documents and materials for the EMNLP-2015 Semantic Similarity tutorial☆30Updated 10 years ago
- A framework for the analysis of social interaction networks (e.g. induced by Twitter mentions) in time.☆61Updated 9 years ago
- Code for the DeepScript Submission to ICFHR2016 Competition on the Classification of Medieval Handwritings in Latin Script☆17Updated 8 years ago
- framework for doing NER and other types of entity recognition, in Python☆68Updated 3 years ago