emsi / wordvectorsLinks
How to train Word2Vec for your language.
☆11Updated 7 years ago
Alternatives and similar repositories for wordvectors
Users that are interested in wordvectors are comparing it to the libraries listed below
Sorting:
- ☆30Updated 2 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.☆300Updated 3 years ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆35Updated 4 years ago
- Polish morphological tagger.☆43Updated 2 years ago
- Evaluation of Sentence Representations in Polish☆22Updated 2 years ago
- RoBERTa models for Polish☆87Updated 3 years ago
- Polish BERT☆70Updated 4 years ago
- Popular stopwords for general languages - very usefull for building dictionaries, searchers or text indexes☆45Updated 11 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆245Updated 2 years ago
- Pre-trained models and language resources for Natural Language Processing in Polish☆342Updated last year
- HerBERT is a BERT-based Language Model trained on Polish Corpora using only MLM objective with dynamic masking of whole words.☆67Updated 3 years ago
- ☆50Updated 2 years ago
- Package that returns a company embedding given a company name☆46Updated 5 years ago
- Romanian WordNet (Data + API for Python)☆52Updated 4 years ago
- HuSpaCy: industrial-strength Hungarian natural language processing☆169Updated 8 months ago
- spaCy pipeline object for negating concepts in text☆281Updated last week
- Algorithms to categorize products and do named entity recognition on words in product descriptions☆246Updated last year
- A Python library for calculating a large variety of metrics from text☆340Updated 6 months ago
- Building a text classifier with extremely small datasets☆44Updated 5 years ago
- Python library for Natural Language Preprocessing (NLPre)☆191Updated last year
- NLP French language model implementing ULMFiT☆87Updated 6 years ago
- Shared BERT model for 4 languages of Bulgarian, Czech, Polish and Russian. Slavic NER model.☆76Updated 3 years ago
- Resources for doing NLP in Polish☆47Updated 5 years ago
- 🏖TagEditor - Annotation tool for spaCy☆192Updated 2 years ago
- Live survey of off-the-shelf language identification tools for python☆26Updated 3 years ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆182Updated 2 weeks ago
- E3C is a freely available multilingual corpus (Italian, English, French, Spanish, and Basque) of semantically annotated clinical narrativ…☆25Updated last year
- Character-based word embeddings model based on RNN for handling real world texts☆173Updated last year
- ☆30Updated 2 years ago
- A spaCy wrapper for DBpedia Spotlight☆110Updated 2 years ago