dccuchile / lightweight-spanish-language-modelsLinks
ALBETO and DistilBETO are versions of ALBERT and DistilBERT pre-trained exclusively on Spanish corpora.
☆40Updated 2 years ago
Alternatives and similar repositories for lightweight-spanish-language-models
Users that are interested in lightweight-spanish-language-models are comparing it to the libraries listed below
Sorting:
- ☆43Updated 8 months ago
- Unannotated Spanish 3 Billion Words Corpora☆104Updated 3 years ago
- BETO - Spanish version of the BERT model☆499Updated 2 years ago
- Spanish word embeddings computed with different methods and from different corpora☆364Updated 6 years ago
- A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks☆640Updated last year
- WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Wor…☆182Updated last month
- Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).☆262Updated 2 years ago
- Natural Language Processing☆265Updated 6 months ago
- A french sequence to sequence pretrained model☆62Updated 3 years ago
- Sentiment analysis and emotion classification for Italian using BERT (fine-tuning). Published at the WASSA workshop (EACL2021).☆27Updated last year
- A Python library for calculating a large variety of metrics from text☆358Updated last year
- How good is BERT ? Comparing BERT to other state-of-the-art approaches on a French sentiment analysis dataset☆157Updated 2 years ago
- ☆23Updated 4 years ago
- The robust European language model benchmark.☆152Updated this week
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆201Updated 4 months ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- A pre-trained language model for social media text in Spanish☆35Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆220Updated last year
- A module to compute textual lexical richness (aka lexical diversity).☆112Updated 2 years ago
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆99Updated last year
- ☆181Updated last year
- Ready to use Spanish Word2Vec embeddings created from >18B chars and >3B words☆45Updated 6 years ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆266Updated last year
- A Scandinavian Benchmark for sentence embeddings☆44Updated last month
- SpanMarker for Named Entity Recognition☆464Updated last year
- Portuguese translation of the GLUE benchmark and Scitail dataset☆32Updated 3 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆182Updated 7 months ago
- This is a simple Python package for calculating a variety of lexical diversity indices☆82Updated 2 years ago
- Curated list of open-access/open-source/off-the-shelf resources and tools developed with a particular focus on German☆514Updated last year
- The Greek NLP toolkit for Python. Supports NER/DP/POS Tagging/Greeklish-to-Greek Transliteration. Visit the web demo here: https://huggin…☆81Updated 6 months ago