vackosar / fasttext-vector-norms-and-oov-words
This blog post visualize vector norms of FastText embedding and evaluates use of FastText word vector norm multiplied with number of word n-grams for detecting non-english OOV words.
☆19Updated last year
Alternatives and similar repositories for fasttext-vector-norms-and-oov-words:
Users that are interested in fasttext-vector-norms-and-oov-words are comparing it to the libraries listed below
- An Implementation of ERNIE For Language Understanding (including Pre-training models and Fine-tuning tools)☆27Updated 5 years ago
- Similarity search on Wikipedia using gensim in Python.☆60Updated 6 years ago
- Deep Semantic Code Search aims to explore a joint embedding space for code and description vectors and then use it for a code search appl…☆65Updated 9 months ago
- Implementation of GloVe in Keras☆45Updated 2 years ago
- A spell checker built from GloVe word vectors☆81Updated 6 years ago
- An open relation extraction system☆46Updated 3 years ago
- A collection of simple tutorials for using Fonduer☆99Updated 4 years ago
- Build a deep learning model for predicting the named entities from text.☆56Updated 6 years ago
- a Deep Learning based Speller☆27Updated 6 years ago
- A web application tagging and retrieval of arguments in text☆28Updated last year
- Word Embeddings for Information Retrieval☆225Updated last year
- TETRE: a Toolkit for Exploring Text for Relation Extraction☆75Updated 7 years ago
- GloVe word vector embedding experiments (similar to Word2Vec)☆67Updated last year
- Neural Network for Automatic Negation Detection☆20Updated 8 years ago
- ALMa (Active Learning Manager) Keeps track of labeled and unlabeled data for active learning☆41Updated 4 years ago
- Polyglot skipgram embeddings, and their many health benefits☆12Updated 5 years ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆114Updated 2 years ago
- DKPro WSD: A Java framework for word sense disambiguation☆20Updated 2 years ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆138Updated 2 years ago
- Text processing library for sentiment analysis and related tasks☆27Updated 6 years ago
- 🤹♀️ Query spaCy's linguistic annotations using GraphQL☆86Updated 6 years ago
- N3 - A Collection of Datasets for Named Entity Recognition and Disambiguation in the NLP Interchange Format☆70Updated 7 years ago
- 🏖 Easy training and deployment of seq2seq models.☆228Updated 4 years ago
- Python library for advanced text mining☆69Updated 5 years ago
- Datasets I have created for scientific summarization, and a trained BertSum model☆115Updated 5 years ago
- An example on how to train supervised classifiers for multi-label text classification using sklearn pipelines☆109Updated 6 years ago
- Natural language processing (NLP) newsletter right on GitHub☆60Updated 5 years ago
- Natural Language Generation for Gramex applications.☆24Updated 2 years ago
- Utilities for preprocessing text for deep learning with Keras☆180Updated 2 years ago
- Server/Client around Spacy to load spacy only once☆46Updated 7 years ago