evagian / Document-similarity-K-shingles-minhashing-LSH-pythonLinks
☆32Updated 7 years ago
Alternatives and similar repositories for Document-similarity-K-shingles-minhashing-LSH-python
Users that are interested in Document-similarity-K-shingles-minhashing-LSH-python are comparing it to the libraries listed below
Sorting:
- Collection of functions and scripts for text retrieval in Python: Document collection preprocessing, Feature Selection, Indexing, Query p…☆43Updated 12 years ago
- Extract synonyms, keywords from sentences using modified implementation of Aho Corasick algorithm☆40Updated 8 years ago
- An example on how to train supervised classifiers for multi-label text classification using sklearn pipelines☆110Updated 7 years ago
- Machine learning prediction of movies genres using Gensim's Doc2Vec and PyMongo - (Python, MongoDB)☆37Updated 2 years ago
- Entity level sentiment analysis for product reviews using deep learning☆56Updated 9 years ago
- Code base for representation learning of very short texts, such as tweets. By Cedric De Boom, IBCN, Ghent University, Belgium.☆35Updated 9 years ago
- Relatively simple text classification powered by spaCy☆41Updated 9 years ago
- Materials for O'Reilly DL 4 NLP tutorial (SF 2017)☆25Updated 8 years ago
- A library & tools to evaluate predictive language models.☆63Updated 2 years ago
- Code from http://www.ark.cs.cmu.edu/mheilman/questions/☆12Updated 12 years ago
- ☆25Updated 7 years ago
- Implements Rocchio Query Expansion - similar to "related searches:" found at popular search engines but based on relevant documents selec…☆20Updated 9 years ago
- Yelp Restaurant Photo Classification - Kaggle competition☆12Updated 6 years ago
- Text Preprocessing in Python☆19Updated 8 years ago
- Benchmarks for Kaggle's Predict Closed Questions on Stack Overflow competition☆55Updated 9 years ago
- Code to munge data between Kaggle .tsv Rotten Tomatoes Sentiment Analysis data set and Vowpal Wabbit☆24Updated 11 years ago
- Quick-Data-Science-Experiments☆19Updated 7 years ago
- Tools and services for evaluating topic models☆15Updated 9 years ago
- The top 10 solution to the "Growing Instability: Classifying Crisis Reports" challenge☆21Updated 8 years ago
- Train word embeddings with Gensim and vizualize them with TensorBoard☆34Updated 6 years ago
- ☆25Updated 9 years ago
- Document clustering in Python☆30Updated 9 years ago
- ☆18Updated 7 years ago
- Materials for O'Reilly DL 4 NLP tutorial (NYC, June 2017)☆37Updated 8 years ago
- A simple CNN implementation in Keras.☆30Updated 9 years ago
- Developing different methods for expanding a query/topic in information retrieval and choosing the best expanded query using similarity m…☆11Updated 8 years ago
- A project on achieving Named-Entity Recognition using Deep Learning.☆25Updated 6 years ago
- Fraud Detection using ensemble of Statistical, Network analysis and Machine learning approach.☆68Updated 10 years ago
- Tensorflow implementation of Facebook TagSpace☆74Updated 6 years ago
- Predicting the number of likes an instagram post will receive in 24 hours - winning solution☆57Updated 8 years ago