vackosar / fasttext-vector-norms-and-oov-wordsLinks
This blog post visualize vector norms of FastText embedding and evaluates use of FastText word vector norm multiplied with number of word n-grams for detecting non-english OOV words.
☆19Updated 2 years ago
Alternatives and similar repositories for fasttext-vector-norms-and-oov-words
Users that are interested in fasttext-vector-norms-and-oov-words are comparing it to the libraries listed below
Sorting:
- Polyglot skipgram embeddings, and their many health benefits☆12Updated 6 years ago
- Deep Semantic Code Search aims to explore a joint embedding space for code and description vectors and then use it for a code search appl…☆66Updated last year
- TETRE: a Toolkit for Exploring Text for Relation Extraction☆75Updated 8 years ago
- Similarity search on Wikipedia using gensim in Python.☆60Updated 7 years ago
- Getting recommendations from natural language☆123Updated 5 years ago
- Natural Language Data Augmentation Tool for Conversational Systems☆115Updated 3 years ago
- Generating labels for topics automatically using neural embeddings☆185Updated 4 months ago
- Misspelling Oblivious Word Embeddings☆201Updated 6 years ago
- Automatic labeling for topic model☆57Updated 10 years ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆140Updated 3 years ago
- ☆123Updated 2 years ago
- Python library for advanced text mining☆69Updated 5 years ago
- Word Embeddings for Information Retrieval☆225Updated 2 years ago
- 🤹♀️ Query spaCy's linguistic annotations using GraphQL☆86Updated 7 years ago
- 🏖 Easy training and deployment of seq2seq models.☆228Updated 4 years ago
- GloVe word vector embedding experiments (similar to Word2Vec)☆67Updated 2 years ago
- Train a Word2Vec model or LSA model, and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jo…☆259Updated 6 years ago
- Organized Resources for Deep Learning in Natural Language Processing☆438Updated 5 years ago
- Named Entity Recognition data for Europeana Newspapers☆173Updated 2 years ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆118Updated 6 months ago
- Tool for exploring Word Vector models☆181Updated 7 years ago
- Deep learning with text doesn't have to be scary.☆274Updated 3 years ago
- Meta-repository for the open-source version of the SUMMA Platform☆16Updated last year
- a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sen…☆228Updated 3 years ago
- Inspired by http://nlp.stanford.edu/courses/cs224n/2015/reports/1.pdf☆57Updated 9 years ago
- The project contains tutorials and resources for the sequence to sql technieques.☆94Updated 2 years ago
- Automated Outlier Detection and Treatment Tool☆102Updated 3 years ago
- Representing research papers as vectors / latent representations.☆198Updated 4 years ago
- displaCy-ent.js: An open-source named entity visualiser for the modern web☆200Updated 7 years ago
- Set up the CTRL text-generating model on Google Compute Engine with just a few console commands.☆151Updated 6 years ago