somosnlp / corpus-esLinks
Lista de corpus de PLN en español ✨ #Somos600M: Ayuda a desarrollar IA inclusiva que entienda las diferentes variedades de nuestras lenguas ✨ English-speaking contributors welcome!
☆21Updated last year
Alternatives and similar repositories for corpus-es
Users that are interested in corpus-es are comparing it to the libraries listed below
Sorting:
- Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).☆260Updated 2 years ago
- Course for Interpreting ML Models☆52Updated 2 years ago
- ☆23Updated 2 years ago
- Unannotated Spanish 3 Billion Words Corpora☆105Updated 3 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 3 years ago
- ☆41Updated 6 months ago
- Topic modeling helpers using managed language models from Cohere. Name text clusters using large GPT models.☆223Updated 2 years ago
- Curso práctico: NLP de cero a cien 🤗☆188Updated last year
- Spanish word embeddings computed with different methods and from different corpora☆361Updated 6 years ago
- Clustering sentence embeddings to extract message intent☆174Updated 4 years ago
- WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Wor…☆182Updated 3 months ago
- Seahorse is a dataset for multilingual, multi-faceted summarization evaluation. It consists of 96K summaries with human ratings along 6 q…☆89Updated last year
- Some notebooks for NLP☆207Updated last year
- this is where we share notebooks/projects used in your youtube channel☆149Updated 4 years ago
- Camoscio: An Italian instruction-tuned language model based on LLaMA☆127Updated last year
- MAFAND-MT☆59Updated last year
- Enterprise Scale NLP with Hugging Face & SageMaker Workshop series☆240Updated 2 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆59Updated 2 years ago
- Efficiently find the best-suited language model (LM) for your NLP task☆127Updated 3 months ago
- BETO - Spanish version of the BERT model☆499Updated 2 years ago
- Benchmarks for Evaluating Spanish Language Models☆11Updated 2 years ago
- Scripts to convert datasets from various sources to Hugging Face Datasets.☆57Updated 3 years ago
- Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data☆158Updated 2 years ago
- An assignment for CMU CS11-711 Advanced NLP, building NLP systems from scratch☆171Updated 2 years ago
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- ☆169Updated last year
- Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.☆18Updated last year
- A library to synthesize text datasets using Large Language Models (LLM)☆151Updated 2 years ago
- Portuguese translation of the GLUE benchmark and Scitail dataset☆32Updated 3 years ago
- ☆25Updated 2 years ago