evagian / Document-similarity-K-shingles-minhashing-LSH-pythonLinks
☆32Updated 7 years ago
Alternatives and similar repositories for Document-similarity-K-shingles-minhashing-LSH-python
Users that are interested in Document-similarity-K-shingles-minhashing-LSH-python are comparing it to the libraries listed below
Sorting:
- Machine learning prediction of movies genres using Gensim's Doc2Vec and PyMongo - (Python, MongoDB)☆37Updated 2 years ago
- Implements Rocchio Query Expansion - similar to "related searches:" found at popular search engines but based on relevant documents selec…☆20Updated 9 years ago
- Code from http://www.ark.cs.cmu.edu/mheilman/questions/☆12Updated 12 years ago
- A library & tools to evaluate predictive language models.☆63Updated 2 years ago
- Materials for O'Reilly DL 4 NLP tutorial (SF 2017)☆25Updated 8 years ago
- Collection of functions and scripts for text retrieval in Python: Document collection preprocessing, Feature Selection, Indexing, Query p…☆43Updated 12 years ago
- Extract synonyms, keywords from sentences using modified implementation of Aho Corasick algorithm☆40Updated 8 years ago
- Entity level sentiment analysis for product reviews using deep learning☆56Updated 9 years ago
- Document clustering in Python☆30Updated 9 years ago
- Tools and services for evaluating topic models☆15Updated 9 years ago
- An example on how to train supervised classifiers for multi-label text classification using sklearn pipelines☆110Updated 7 years ago
- Relatively simple text classification powered by spaCy☆41Updated 10 years ago
- Code base for representation learning of very short texts, such as tweets. By Cedric De Boom, IBCN, Ghent University, Belgium.☆34Updated 9 years ago
- ☆18Updated 7 years ago
- Use RNNs to identify entities in news queries☆56Updated 9 years ago
- A pure python implementation of locality sensitive hashing for text documents☆85Updated 10 years ago
- LSH based high dimensional clustering for sets and points☆79Updated 10 years ago
- Text Preprocessing in Python☆19Updated 8 years ago
- Variants of Multi-Perspective Convolutional Neural Networks☆22Updated 2 years ago
- Discovers similarity between scientific papers☆62Updated 9 years ago
- Utilities for preprocessing text for deep learning with Keras☆180Updated 2 years ago
- Similarity search on Wikipedia using gensim in Python.☆60Updated 6 years ago
- Feature-Time Instability Metric☆44Updated 9 years ago
- Benchmarks for Kaggle's Predict Closed Questions on Stack Overflow competition☆55Updated 9 years ago
- Introduction to structured prediction with Python and pystruct☆18Updated 7 years ago
- Code For Medium Article "How To Create Data Products That Are Magical Using Sequence-to-Sequence Models"☆138Updated 2 years ago
- Code for Large Scale Hierarchical Text Classification competition. Final place: 3rd☆37Updated 11 years ago
- Word2Vec + Principal Component Analysis + Clustering for low-dimensional semantic representation of a set of words or compositional MWEs.☆20Updated 10 years ago
- Augmenting word embeddings with their surrounding context using bidirectional RNN☆60Updated 5 years ago
- Quick-Data-Science-Experiments☆19Updated 7 years ago