dhwajraj / spark-twitter-named-entity
Named Entity Extraction on Twitter Stream using Apache Spark Streaming and Stanford CoreNLP
☆15Updated 8 years ago
Alternatives and similar repositories for spark-twitter-named-entity:
Users that are interested in spark-twitter-named-entity are comparing it to the libraries listed below
- Simple FieldCache based query introspection Solr Search Component - solves the 'red sofa' problem☆12Updated 3 months ago
- Implicit relation extractor using a natural language model.☆25Updated 6 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago
- Collects multimedia content shared through social networks.☆19Updated 10 years ago
- Movielens collaborative filtering with Solr streaming expression☆11Updated 8 years ago
- Regularized latent variable mixed membership modeling☆13Updated 11 years ago
- Project for the talk on NLP using LSTM implementation from DL4J on Spark☆20Updated 8 years ago
- Tweet Analysis with Spark☆15Updated 7 years ago
- DKPro WSD: A Java framework for word sense disambiguation☆20Updated 2 years ago
- Fast and robust NLP components implemented in Java.☆52Updated 4 years ago
- Base components for Question Answering pipelines☆28Updated 2 years ago
- ☆20Updated 8 years ago
- Provided Guidance on Creating End to End Solutions for Common SILK Use Cases☆13Updated 9 years ago
- Script to perform dictionary based n-gram text tagging efficiently in apache spark☆11Updated 8 years ago
- Exploration Library in Java☆12Updated last year
- Scala port of the word2vec toolkit.☆11Updated 8 years ago
- Text similarity based on Word2Vec vectors.☆11Updated 8 years ago
- ☆20Updated 8 years ago
- Elasticsearch Latent Semantic Indexing experimentation☆33Updated 5 years ago
- Algorithms that build k-nearest neighbors graph (k-nn graph): Brute-force, NN-Descent,...☆34Updated 6 years ago
- A suite of tools for sequence tagging, including regular and "deep" CRF, as well as convolutional and recurrent neural networks.☆9Updated 9 years ago
- Java library for Concrete, a data serialization format for NLP☆6Updated 5 years ago
- Implementation of the Chinese Whispers graph clustering algorithm☆8Updated 7 years ago
- Deep learning spelling patterns with a recurrent neural network☆12Updated 7 years ago
- System for mining Wikipedia Usage data to read our collective mind☆21Updated 10 years ago
- Code for KDD 2014 paper "Mining Topics in Documents: Standing on the Shoulders of Big Data"☆21Updated 9 years ago
- Keyword extraction package for Spark.☆12Updated 8 years ago
- code and slides for my PyGotham 2016 talk, "Higher-level Natural Language Processing with textacy"☆15Updated 8 years ago
- Library for building reproducible data pipelines to support experimentation☆20Updated 9 years ago
- Linking Entities in CommonCrawl Dataset onto Wikipedia Concepts☆59Updated 12 years ago