somosnlp / corpus-esLinks
Lista de corpus de PLN en español ✨ #Somos600M: Ayuda a desarrollar IA inclusiva que entienda las diferentes variedades de nuestras lenguas ✨ English-speaking contributors welcome!
☆21Updated last year
Alternatives and similar repositories for corpus-es
Users that are interested in corpus-es are comparing it to the libraries listed below
Sorting:
- Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).☆259Updated 2 years ago
- Course for Interpreting ML Models☆52Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- A pre-trained language model for social media text in Spanish☆35Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- Unannotated Spanish 3 Billion Words Corpora☆105Updated 2 years ago
- An assignment for CMU CS11-711 Advanced NLP, building NLP systems from scratch☆171Updated 2 years ago
- Materials for workshops on the Hugging Face ecosystem☆149Updated 2 years ago
- Web UI & Backend for Data Annotations in Aya☆28Updated last year
- ☆41Updated 5 months ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆223Updated 2 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆213Updated last year
- Efficiently find the best-suited language model (LM) for your NLP task☆127Updated 2 months ago
- Explainable Zero-Shot Topic Extraction☆63Updated last year
- SpanMarker for Named Entity Recognition☆453Updated 9 months ago
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Wor…☆181Updated 2 months ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆109Updated last year
- Curso práctico: NLP de cero a cien 🤗☆189Updated last year
- Quote extraction for modular journalism (JournalismAI collab 2021)☆230Updated 3 years ago
- Clustering sentence embeddings to extract message intent☆175Updated 3 years ago
- A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.☆106Updated last year
- Creating class-based TF-IDF matrices☆89Updated 2 years ago
- Educational materials for universities☆374Updated 2 years ago
- A guide book on data science for busy and equally lazy Data Scientists 😄☆133Updated 2 weeks ago
- Official source for Spanish pretrained biomedical and clinical language models and resources made @ BSC-TEMU within the "Plan de las Tecn…☆26Updated 2 years ago
- Dataset containing scroll interactions of 598 partcipants reading advanced and elementary texts from the OneStopEnglish corpus☆16Updated 3 years ago
- A Streamlit application to visualize sentence embeddings☆18Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆244Updated 2 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆95Updated 2 years ago