vackosar / fasttext-vector-norms-and-oov-words
This blog post visualize vector norms of FastText embedding and evaluates use of FastText word vector norm multiplied with number of word n-grams for detecting non-english OOV words.
☆19Updated last year
Related projects ⓘ
Alternatives and complementary repositories for fasttext-vector-norms-and-oov-words
- Similarity search on Wikipedia using gensim in Python.☆61Updated 5 years ago
- a Deep Learning based Speller☆27Updated 5 years ago
- Deep Semantic Code Search aims to explore a joint embedding space for code and description vectors and then use it for a code search appl…☆65Updated 3 months ago
- Polyglot skipgram embeddings, and their many health benefits☆11Updated 4 years ago
- Natural Language Data Augmentation Tool for Conversational Systems☆116Updated last year
- A spell checker built from GloVe word vectors☆81Updated 6 years ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆112Updated 2 years ago
- TETRE: a Toolkit for Exploring Text for Relation Extraction☆76Updated 7 years ago
- A web application tagging and retrieval of arguments in text☆30Updated last year
- Automatic labeling for topic model☆57Updated 9 years ago
- Query-Document Relevance☆42Updated 9 years ago
- Fast supervised sentence boundary detection using the averaged perceptron☆90Updated 5 years ago
- A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.☆82Updated 4 months ago
- Misspelling Oblivious Word Embeddings☆202Updated 5 years ago
- An open relation extraction system☆46Updated 2 years ago
- An Implementation of ERNIE For Language Understanding (including Pre-training models and Fine-tuning tools)☆27Updated 5 years ago
- ☆21Updated 8 years ago
- ☆54Updated 9 years ago
- Incremental learning of word embeddings with context informativeness.☆95Updated last year
- allennlp + streamlit demo☆22Updated 5 years ago
- Implementation of GloVe in Keras☆45Updated last year
- ☆123Updated last year
- A visualisation tool for Spacy using Hierplane.☆65Updated last year
- Making sense embedding out of word embeddings using graph-based word sense induction☆212Updated 3 years ago
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings☆76Updated 2 years ago
- 🔤 Calculate average word embeddings (word2vec) from documents for transfer learning☆54Updated 6 months ago
- Semantic search using Transformers and others☆110Updated 4 years ago
- Word Embeddings for Information Retrieval☆226Updated last year
- Example using Polyaxon to experiment with pre-training spaCy☆65Updated 3 years ago
- Relatively simple text classification powered by spaCy☆42Updated 9 years ago