somosnlp / corpus-es
Lista de corpus de PLN en español ✨ #Somos600M: Ayuda a desarrollar IA inclusiva que entienda las diferentes variedades de nuestras lenguas ✨ English-speaking contributors welcome!
☆18Updated 10 months ago
Related projects ⓘ
Alternatives and complementary repositories for corpus-es
- Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).☆253Updated last year
- A pre-trained language model for social media text in Spanish☆34Updated last year
- Curso práctico: NLP de cero a cien 🤗☆183Updated 7 months ago
- Spanish word embeddings computed with different methods and from different corpora☆356Updated 5 years ago
- Unannotated Spanish 3 Billion Words Corpora☆92Updated 2 years ago
- Official source for Spanish pretrained biomedical and clinical language models and resources made @ BSC-TEMU within the "Plan de las Tecn…☆26Updated 2 years ago
- BETO - Spanish version of the BERT model☆492Updated last year
- A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks☆559Updated 4 months ago
- Spanish Billion Word Corpus and Embeddings☆45Updated last year
- Ready to use Spanish Word2Vec embeddings created from >18B chars and >3B words☆41Updated 5 years ago
- WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Wor…☆175Updated 5 months ago
- ☆32Updated last year
- Natural Language Processing☆230Updated 4 months ago
- Ejercicios para aprender a hacer NLP impulsado por las librerías de Hugging Face.☆24Updated 2 years ago
- Página web de Somos NLP 🤗 ¡Publica en nuestro blog!☆22Updated this week
- Educational materials for universities☆335Updated last year
- Specialization of BERT architecture both for the Spanish language and the Twitter domain☆13Updated 4 years ago
- spanlp: nlp applied for spanish vulgarity. A fast, robust Python library to check for profanity or offensive language in Spanish strings.…☆36Updated 5 months ago
- Spanish rule-based lemmatization for spaCy☆37Updated 2 years ago
- ☆61Updated last year
- Resources for GLUE benchmark in Spanish☆15Updated 3 years ago
- A Python package for automatically training and comparing language models.☆50Updated 6 months ago
- ☆23Updated 3 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 5 months ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆72Updated last year
- List of research and engineering of NLP for American Native/Indigenous Languages.☆87Updated 3 years ago
- [LREC 2022] An off-the-shelf pre-trained Tweet NLP Toolkit (NER, tokenization, lemmatization, POS tagging, dependency parsing) + Tweeban…☆102Updated 9 months ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- En este proyecto de GitHhub podrás encontrar parte del material que utilizo para impartir las clases de Procesamiento de Lenguaje Natura…☆22Updated last year
- Langchain 101 en Español☆73Updated 11 months ago