vackosar / fasttext-vector-norms-and-oov-wordsLinks
This blog post visualize vector norms of FastText embedding and evaluates use of FastText word vector norm multiplied with number of word n-grams for detecting non-english OOV words.
☆19Updated 2 years ago
Alternatives and similar repositories for fasttext-vector-norms-and-oov-words
Users that are interested in fasttext-vector-norms-and-oov-words are comparing it to the libraries listed below
Sorting:
- Deep Semantic Code Search aims to explore a joint embedding space for code and description vectors and then use it for a code search appl…☆65Updated last year
- TETRE: a Toolkit for Exploring Text for Relation Extraction☆75Updated 8 years ago
- Natural Language Data Augmentation Tool for Conversational Systems☆115Updated 2 years ago
- Polyglot skipgram embeddings, and their many health benefits☆12Updated 5 years ago
- Similarity search on Wikipedia using gensim in Python.☆60Updated 6 years ago
- a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sen…☆230Updated 2 years ago
- Set up the CTRL text-generating model on Google Compute Engine with just a few console commands.☆149Updated 5 years ago
- 🤹♀️ Query spaCy's linguistic annotations using GraphQL☆86Updated 7 years ago
- Intuitive Annotation Tool for Information Extraction / Named Entity Recognition using localturk / Amazon Mechanical Turk☆264Updated 6 years ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆140Updated 3 years ago
- Graph NLU is a natural language understanding tool that leverages the power of graph databases☆86Updated 7 years ago
- A collection of simple tutorials for using Fonduer☆100Updated 4 years ago
- Getting recommendations from natural language☆123Updated 5 years ago
- Inspired by http://nlp.stanford.edu/courses/cs224n/2015/reports/1.pdf☆57Updated 9 years ago
- Textpipe: clean and extract metadata from text☆302Updated 4 years ago
- displaCy-ent.js: An open-source named entity visualiser for the modern web☆198Updated 7 years ago
- Agent-based modelling for resource allocation in viral crises to investigate resource allocation and policy interventions with respect to…☆63Updated 5 years ago
- An Implementation of ERNIE For Language Understanding (including Pre-training models and Fine-tuning tools)☆27Updated 6 years ago
- Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in …☆128Updated 6 years ago
- Misspelling Oblivious Word Embeddings☆201Updated 6 years ago
- LanguageCrunch NLP server docker image☆285Updated 2 years ago
- A model to transform english into Cypher queries, based off the CLEVR-graph dataset☆75Updated 7 years ago
- Word Embeddings for Information Retrieval☆225Updated last year
- Generating labels for topics automatically using neural embeddings☆185Updated 5 months ago
- Python library for advanced text mining☆69Updated 5 years ago
- Lightning Fast Language Prediction 🚀☆167Updated last week
- A guide on extracting entities from raw text in order to conduct social network analysis.☆21Updated 8 years ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆118Updated last month
- Train a Word2Vec model or LSA model, and Implement Conceptual Search\Semantic Search in Solr\Lucene - Simon Hughes Dice.com, Dice Tech Jo…☆257Updated 6 years ago
- 🏖 Easy training and deployment of seq2seq models.☆227Updated 4 years ago