vackosar / fasttext-vector-norms-and-oov-wordsLinks
This blog post visualize vector norms of FastText embedding and evaluates use of FastText word vector norm multiplied with number of word n-grams for detecting non-english OOV words.
☆19Updated last year
Alternatives and similar repositories for fasttext-vector-norms-and-oov-words
Users that are interested in fasttext-vector-norms-and-oov-words are comparing it to the libraries listed below
Sorting:
- Similarity search on Wikipedia using gensim in Python.☆60Updated 6 years ago
- Polyglot skipgram embeddings, and their many health benefits☆12Updated 5 years ago
- a Deep Learning based Speller☆27Updated 6 years ago
- An Implementation of ERNIE For Language Understanding (including Pre-training models and Fine-tuning tools)☆27Updated 5 years ago
- A web application tagging and retrieval of arguments in text☆29Updated 2 years ago
- Deep Semantic Code Search aims to explore a joint embedding space for code and description vectors and then use it for a code search appl…☆65Updated 10 months ago
- ☆21Updated 9 years ago
- TETRE: a Toolkit for Exploring Text for Relation Extraction☆75Updated 7 years ago
- Relatively simple text classification powered by spaCy☆41Updated 9 years ago
- An open relation extraction system☆46Updated 3 years ago
- Neural Network for Automatic Negation Detection☆20Updated 8 years ago
- Automatic labeling for topic model☆57Updated 9 years ago
- 🤹♀️ Query spaCy's linguistic annotations using GraphQL☆86Updated 6 years ago
- Representing research papers as vectors / latent representations.☆198Updated 4 years ago
- Text processing library for sentiment analysis and related tasks☆27Updated 6 years ago
- 🔤 Calculate average word embeddings (word2vec) from documents for transfer learning☆54Updated last year
- Natural Language Data Augmentation Tool for Conversational Systems☆115Updated 2 years ago
- An evaluation of word-embeddings for classification☆32Updated 6 years ago
- An Apache Lucene TokenFilter that uses a word2vec vectors for term expansion.☆24Updated 11 years ago
- Wikipedia-based Explicit Semantic Analysis, as described by Gabrilovich and Markovitch☆35Updated 5 years ago
- Server/Client around Spacy to load spacy only once☆46Updated 7 years ago
- Polyglot is a language identifier for detecting text documents containing text written in more than one language, and for identifying the…☆33Updated 8 years ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆138Updated 2 years ago
- A spell checker built from GloVe word vectors☆81Updated 7 years ago
- Word Embeddings for Information Retrieval☆225Updated last year
- allennlp + streamlit demo☆22Updated 5 years ago
- Python library for advanced text mining☆69Updated 5 years ago
- Getting recommendations from natural language☆123Updated 5 years ago
- Labeled examples from wiki dumps in Python☆67Updated 8 years ago
- Train word embeddings with Gensim and vizualize them with TensorBoard☆34Updated 6 years ago