emsi / wordvectorsLinks
How to train Word2Vec for your language.
☆11Updated 7 years ago
Alternatives and similar repositories for wordvectors
Users that are interested in wordvectors are comparing it to the libraries listed below
Sorting:
- ☆30Updated 2 years ago
- HuSpaCy: industrial-strength Hungarian natural language processing☆171Updated last month
- Spacy pipeline object for extracting values that correspond to a named entity (e.g., birth dates, account numbers, laboratory results)☆55Updated 3 years ago
- Building a text classifier with extremely small datasets☆44Updated 5 years ago
- Docker images for production NLP usage including deep learning☆35Updated 6 years ago
- PYthon Automated Term Extraction☆315Updated 2 years ago
- Information extraction from English and German texts based on predicate logic☆391Updated 3 years ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆219Updated 7 months ago
- N-gram Extraction Approaches (bigrams, trigrams)☆43Updated 6 years ago
- RoBERTa models for Polish☆88Updated 3 years ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆35Updated 4 years ago
- A simple component to display annotated text in Streamlit apps.☆551Updated 7 months ago
- ☆50Updated 3 years ago
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- 🍳 Recipes for the Prodigy, our fully scriptable annotation tool☆498Updated last year
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆272Updated 2 years ago
- This repo is the home of Romanian Transformers.☆105Updated 2 years ago
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆525Updated 10 months ago
- LexRank algorithm for text summarization☆230Updated last year
- Character-based word embeddings model based on RNN for handling real world texts☆174Updated last year
- Segment a HTML document into structural data☆12Updated 6 years ago
- A spaCy wrapper for DBpedia Spotlight☆110Updated 2 years ago
- UmBERTo: an Italian Language Model trained with Whole Word Masking.☆107Updated 2 years ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆265Updated 9 months ago
- Named Entity Recognition (NER) Annotation tool for SpaCy. Generates Traning Data as a JSON which can be readily used.☆585Updated 6 months ago
- Clustering sentence embeddings to extract message intent☆175Updated 3 years ago
- ☆30Updated 2 years ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆417Updated 7 months ago
- 📗 Score text readability using a number of formulas: Flesch-Kincaid Grade Level, Gunning Fog, ARI, Dale Chall, SMOG, and more☆386Updated 11 months ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆399Updated 4 years ago