somosnlp / corpus-esLinks
Lista de corpus de PLN en español ✨ #Somos600M: Ayuda a desarrollar IA inclusiva que entienda las diferentes variedades de nuestras lenguas ✨ English-speaking contributors welcome!
☆22Updated last year
Alternatives and similar repositories for corpus-es
Users that are interested in corpus-es are comparing it to the libraries listed below
Sorting:
- Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).☆261Updated 2 years ago
- A pre-trained language model for social media text in Spanish☆35Updated 2 years ago
- Unannotated Spanish 3 Billion Words Corpora☆105Updated 3 years ago
- ☆42Updated 6 months ago
- Curso práctico: NLP de cero a cien 🤗☆188Updated last year
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆222Updated 2 years ago
- Course for Interpreting ML Models☆52Updated 2 years ago
- WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Wor…☆182Updated 3 months ago
- Spanish word embeddings computed with different methods and from different corpora☆362Updated 6 years ago
- BETO - Spanish version of the BERT model☆499Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆79Updated 3 years ago
- UmBERTo: an Italian Language Model trained with Whole Word Masking.☆110Updated 2 years ago
- this is where we share notebooks/projects used in your youtube channel☆149Updated 4 years ago
- Educational materials for universities☆375Updated 2 years ago
- This repository contains an easy and intuitive approach to use SetFit in combination with spaCy.☆80Updated 2 years ago
- ALBETO and DistilBETO are versions of ALBERT and DistilBERT pre-trained exclusively on Spanish corpora.☆38Updated 2 years ago
- ☆24Updated 2 years ago
- SpanMarker for Named Entity Recognition☆463Updated 10 months ago
- TimeLMs: Diachronic Language Models from Twitter☆111Updated last year
- Explainable Zero-Shot Topic Extraction☆63Updated last year
- Fine-tuning a Hugging Face BERT model for the United Nations Named Entity Recognition task.☆35Updated 4 years ago
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆322Updated 2 years ago
- An assignment for CMU CS11-711 Advanced NLP, building NLP systems from scratch☆171Updated 2 years ago
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- Spanish data from the AnCora corpus.☆31Updated last week
- A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks☆635Updated last year
- A Python library aimed at dissecting and augmenting NER training data.☆59Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated 2 years ago
- Creating class-based TF-IDF matrices☆90Updated 3 years ago
- Repo for the Belebele dataset, a massively multilingual reading comprehension dataset.☆336Updated 11 months ago