somosnlp / corpus-esLinks
Lista de corpus de PLN en español ✨ #Somos600M: Ayuda a desarrollar IA inclusiva que entienda las diferentes variedades de nuestras lenguas ✨ English-speaking contributors welcome!
☆23Updated last year
Alternatives and similar repositories for corpus-es
Users that are interested in corpus-es are comparing it to the libraries listed below
Sorting:
- Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).☆262Updated 2 years ago
- Course for Interpreting ML Models☆52Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆79Updated 3 years ago
- Educational materials for universities☆376Updated 2 years ago
- Unannotated Spanish 3 Billion Words Corpora☆105Updated 3 years ago
- Curso práctico: NLP de cero a cien 🤗☆188Updated last year
- A pre-trained language model for social media text in Spanish☆35Updated 2 years ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆222Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆59Updated 2 years ago
- Materials for workshops on the Hugging Face ecosystem☆150Updated 2 years ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆89Updated last year
- A python package for benchmarking interpretability techniques on Transformers.☆214Updated last year
- ☆43Updated 7 months ago
- Official source for Spanish pretrained biomedical and clinical language models and resources made @ BSC-TEMU within the "Plan de las Tecn…☆26Updated 3 years ago
- ☆24Updated 2 years ago
- SpanMarker for Named Entity Recognition☆462Updated 11 months ago
- ☆84Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆111Updated last year
- WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Wor…☆182Updated 3 weeks ago
- A lightweight Python library for constructing, processing, and visualizing constituent trees.☆68Updated 3 weeks ago
- A library to synthesize text datasets using Large Language Models (LLM)☆152Updated 2 years ago
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- BETO - Spanish version of the BERT model☆500Updated 2 years ago
- Set of vectorizers that extract keyphrases with part-of-speech patterns from a collection of text documents and convert them into a docum…☆266Updated last year
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆323Updated 2 years ago
- A guide book on data science for busy and equally lazy Data Scientists 😄☆135Updated last month
- ☆40Updated 3 years ago
- An assignment for CMU CS11-711 Advanced NLP, building NLP systems from scratch☆171Updated 3 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆56Updated 2 years ago
- Spanish Billion Word Corpus and Embeddings☆51Updated 3 years ago