somosnlp / corpus-esLinks
Lista de corpus de PLN en español ✨ #Somos600M: Ayuda a desarrollar IA inclusiva que entienda las diferentes variedades de nuestras lenguas ✨ English-speaking contributors welcome!
☆22Updated last year
Alternatives and similar repositories for corpus-es
Users that are interested in corpus-es are comparing it to the libraries listed below
Sorting:
- Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).☆260Updated 2 years ago
- Unannotated Spanish 3 Billion Words Corpora☆102Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- Clustering sentence embeddings to extract message intent☆175Updated 3 years ago
- A pre-trained language model for social media text in Spanish☆35Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆58Updated 2 years ago
- ☆40Updated 3 months ago
- WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Wor…☆180Updated last week
- Curso práctico: NLP de cero a cien 🤗☆190Updated last year
- SpanMarker for Named Entity Recognition☆444Updated 7 months ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆223Updated 2 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆214Updated 10 months ago
- Creating class-based TF-IDF matrices☆88Updated 2 years ago
- ✨ Bootstrap annotation with zero- & few-shot learning via OpenAI GPT-3☆323Updated 2 years ago
- 💫 SpaCy wrapper for ConceptNet 💫☆94Updated last year
- Spanish word embeddings computed with different methods and from different corpora☆360Updated 5 years ago
- Enterprise Scale NLP with Hugging Face & SageMaker Workshop series☆240Updated 2 years ago
- BETO - Spanish version of the BERT model☆497Updated last year
- A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks☆604Updated last year
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆183Updated 3 weeks ago
- A Streamlit application to visualize sentence embeddings☆19Updated 2 years ago
- Spanish Billion Word Corpus and Embeddings☆48Updated 2 years ago
- [EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.☆108Updated last year
- Official source for Spanish pretrained biomedical and clinical language models and resources made @ BSC-TEMU within the "Plan de las Tecn…☆26Updated 2 years ago
- Educational materials for universities☆369Updated last year
- Web UI & Backend for Data Annotations in Aya☆28Updated last year
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆92Updated 3 years ago
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- this is where we share notebooks/projects used in your youtube channel☆148Updated 4 years ago
- Mining Legal Arguments in Court Decisions - Data and software☆68Updated 2 years ago