emsi / wordvectorsLinks
How to train Word2Vec for your language.
☆10Updated 8 years ago
Alternatives and similar repositories for wordvectors
Users that are interested in wordvectors are comparing it to the libraries listed below
Sorting:
- ☆30Updated 3 years ago
- Popular stopwords for general languages - very usefull for building dictionaries, searchers or text indexes☆44Updated 12 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.☆306Updated 4 years ago
- RoBERTa models for Polish☆89Updated 3 years ago
- Pre-trained models and language resources for Natural Language Processing in Polish☆364Updated last year
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆256Updated 3 years ago
- Evaluation of Sentence Representations in Polish☆23Updated 3 years ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆79Updated 4 years ago
- Polish morphological tagger.☆43Updated 2 years ago
- Polish RoBERTA model trained on Polish literature, Wikipedia, and Oscar. The major assumption is that quality text will give a good mode…☆35Updated 4 years ago
- Multilingual Rapid Automatic Keyword Extraction (RAKE) for Python☆272Updated 2 years ago
- ☆30Updated 3 years ago
- Spanish word embeddings computed with different methods and from different corpora☆364Updated 6 years ago
- Docker images for production NLP usage including deep learning☆35Updated 7 years ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆79Updated 4 years ago
- CLASSLA Fork of the Official Stanford NLP Python Library for Many Human Languages☆46Updated 8 months ago
- BERT model trained from scratch on Finnish☆95Updated 4 years ago
- Romanian Named Entity Corpus (RONEC) version 2.0☆65Updated 3 years ago
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s …☆141Updated 2 years ago
- NLP French language model implementing ULMFiT☆87Updated 6 years ago
- Resources for doing NLP in Polish☆48Updated 6 years ago
- ☆50Updated 3 years ago
- UmBERTo: an Italian Language Model trained with Whole Word Masking.☆110Updated 3 years ago
- spaCy REST API, wrapped in a Docker container.☆267Updated 2 years ago
- Happy Transformer makes it easy to fine-tune and perform inference with NLP Transformer models.☆540Updated 3 weeks ago
- Transformer language model (GPT-2) with sentencepiece tokenizer☆164Updated 4 years ago
- HerBERT is a BERT-based Language Model trained on Polish Corpora using only MLM objective with dynamic masking of whole words.☆70Updated 3 years ago
- Jupyter Notebooks with Deep Learning Tutorials☆206Updated 6 years ago
- PYthon Automated Term Extraction☆318Updated 2 years ago
- Extraction of the journalistic five W and one H questions (5W1H) from news articles: who did what, when, where, why, and how?☆528Updated last year