vackosar / fasttext-vector-norms-and-oov-wordsLinks
This blog post visualize vector norms of FastText embedding and evaluates use of FastText word vector norm multiplied with number of word n-grams for detecting non-english OOV words.
☆19Updated 2 years ago
Alternatives and similar repositories for fasttext-vector-norms-and-oov-words
Users that are interested in fasttext-vector-norms-and-oov-words are comparing it to the libraries listed below
Sorting:
- Deep Semantic Code Search aims to explore a joint embedding space for code and description vectors and then use it for a code search appl…☆66Updated last year
- Polyglot skipgram embeddings, and their many health benefits☆12Updated 5 years ago
- 🤹♀️ Query spaCy's linguistic annotations using GraphQL☆86Updated 7 years ago
- Similarity search on Wikipedia using gensim in Python.☆60Updated 6 years ago
- Natural Language Generation for Gramex applications.☆25Updated 3 years ago
- Natural Language Data Augmentation Tool for Conversational Systems☆115Updated 2 years ago
- TETRE: a Toolkit for Exploring Text for Relation Extraction☆75Updated 8 years ago
- Getting recommendations from natural language☆123Updated 5 years ago
- Named Entity Recognition data for Europeana Newspapers☆173Updated 2 years ago
- 🏖 Easy training and deployment of seq2seq models.☆227Updated 4 years ago
- A guide on extracting entities from raw text in order to conduct social network analysis.☆21Updated 8 years ago
- a contextual, biasable, word-or-sentence-or-paragraph extractive summarizer powered by the latest in text embeddings (Bert, Universal Sen…☆230Updated 2 years ago
- Interactive Model Iteration with Weak Supervision and Pre-Trained Embeddings☆77Updated 3 years ago
- ☆123Updated 2 years ago
- Representing research papers as vectors / latent representations.☆198Updated 4 years ago
- Utilities for preprocessing text for deep learning with Keras☆180Updated 2 years ago
- Word Embeddings for Information Retrieval☆225Updated 2 years ago
- Misspelling Oblivious Word Embeddings☆201Updated 6 years ago
- A way to do annotations for NER. TALEN: Tool for Annotation of Low-resource ENtities☆118Updated 3 months ago
- Unsupervised Language Model Pre-training for French☆247Updated 2 years ago
- Detect common phrases in large amounts of text using a data-driven approach. Size of discovered phrases can be arbitrary. Can be used in …☆129Updated 6 years ago
- Twitter named entity extraction for WNUT 2016 http://noisy-text.github.io/2016/ner-shared-task.html☆140Updated 3 years ago
- A word2vec negative sampling implementation with correct CBOW update.☆260Updated 3 years ago
- A model to transform english into Cypher queries, based off the CLEVR-graph dataset☆75Updated 7 years ago
- Deep learning with text doesn't have to be scary.☆275Updated 2 years ago
- 🔤 Calculate average word embeddings (word2vec) from documents for transfer learning☆54Updated last year
- Tool for exploring Word Vector models☆180Updated 7 years ago
- Python library for Natural Language Preprocessing (NLPre)☆191Updated 2 years ago
- Python library for advanced text mining☆69Updated 5 years ago
- displaCy-ent.js: An open-source named entity visualiser for the modern web☆198Updated 7 years ago