ccoreilly / spacy-catalaLinks
Spacy NLP Model for the Catalan language
☆16Updated 4 years ago
Alternatives and similar repositories for spacy-catala
Users that are interested in spacy-catala are comparing it to the libraries listed below
Sorting:
- 🤖 Deep Catalan: Bring closer the Catalan Language to Deep Learning using ULMFit.☆12Updated 4 years ago
- Catalan bert model☆12Updated 4 years ago
- Deepspeech ASR Model for the Catalan Language☆17Updated 4 years ago
- This repository contains Neural Machine Translation tools built at Softcatalà☆43Updated 4 months ago
- Wav2Vec 2.0 catalan training scripts and models☆12Updated 4 years ago
- BETO - Spanish version of the BERT model☆497Updated last year
- Public domain corpus of Catalan text☆17Updated 3 years ago
- Official source for Catalan Language Models and resources made within Aina project.☆24Updated last year
- Unannotated Spanish 3 Billion Words Corpora☆102Updated 2 years ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆29Updated 3 years ago
- ALBETO and DistilBETO are versions of ALBERT and DistilBERT pre-trained exclusively on Spanish corpora.☆37Updated 2 years ago
- Tooling for producing Italian model (public release available) for DeepSpeech and text corpus☆94Updated 3 years ago
- A data set and model for german sentiment classification.☆67Updated last month
- German word embeddings computed from a corpus of parliamentary transcripts (2017-2019)☆14Updated 5 years ago
- Text-to-Speech conversor for Basque and Spanish. It includes linguistic processing and built voices for the languages aforementioned. Its…☆17Updated last year
- ☆140Updated 4 years ago
- Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)☆14Updated 5 years ago
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s …☆138Updated 2 years ago
- Free Dutch voice dataset☆12Updated 4 years ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆77Updated 3 years ago
- ParlaMint: Comparable Parliamentary Corpora☆62Updated this week
- finite-state toolkit, EM and Bayesian (Gibbs sampling) training for FST and context-free derivation forests☆14Updated 8 years ago
- Code related to the Dutch instance and user groups of the KALDI speech recognition toolkit☆66Updated last year
- CogNet: a large-scale, high-quality cognate database for 338 languages, 1.07M words, and 8.1 million cognates☆49Updated 2 years ago
- An NLP pipeline for Hebrew☆38Updated last month
- A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks☆598Updated last year
- Pre-production releases for Spacy in Catalan☆14Updated 3 years ago
- A french sequence to sequence pretrained model☆62Updated 2 years ago
- Coqui STT (🐸STT) based forced alignment tool☆13Updated 3 years ago
- A curated list of resources dedicated to Natural Language Processing (NLP) in polish. Models, tools, datasets.☆301Updated 3 years ago