Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).
☆262Jul 27, 2023Updated 2 years ago
Alternatives and similar repositories for lm-spanish
Users that are interested in lm-spanish are comparing it to the libraries listed below
Sorting:
- BETO - Spanish version of the BERT model☆500Oct 21, 2023Updated 2 years ago
- Explora los Telediarios de RTVE desde 2014☆36Jul 26, 2025Updated 7 months ago
- Spanish word embeddings computed with different methods and from different corpora☆364Oct 9, 2019Updated 6 years ago
- Unannotated Spanish 3 Billion Words Corpora☆104Oct 20, 2022Updated 3 years ago
- Spanish Billion Word Corpus and Embeddings☆52Dec 16, 2022Updated 3 years ago
- Anonymization Pipeline for injesting data from outside of BSC that contains GDPR protected data.☆17Nov 10, 2023Updated 2 years ago
- plotting tutorial for maps of Spain with ggplot2☆55Mar 26, 2024Updated last year
- Curated list of Linguistic Resources for doing NLP & CL on Spanish☆348Jan 9, 2024Updated 2 years ago
- Introducción a la ciencia de datos y al aprendizaje automático☆10Nov 2, 2017Updated 8 years ago
- This project aims to study the Image Colorization problem and implement a Convolutional Neural Network that is able to colorize black and…☆11Feb 25, 2021Updated 5 years ago
- Spanish rule-based lemmatization for spaCy☆40Apr 19, 2022Updated 3 years ago
- Combining encoder-based language models☆11Nov 11, 2021Updated 4 years ago
- Datos para el post de Medium sobre COVID-19☆11Mar 24, 2020Updated 5 years ago
- OpenSource platform for downloading and querying Spanish Official Cadaster Registry (Catastro)☆14May 22, 2023Updated 2 years ago
- Scansion tool for Spanish texts☆12Dec 19, 2023Updated 2 years ago
- KnowMAN: Weakly Supervised Multinomial Adversarial Networks☆12Nov 9, 2021Updated 4 years ago
- Project developed with Apache Spark and Kafka that works with different public streaming data APIs such as SkyScanner, GeoDB Cities, and …☆11Mar 3, 2021Updated 5 years ago
- Specialization of BERT architecture both for the Spanish language and the Twitter domain☆13Nov 6, 2020Updated 5 years ago
- Identifying relevant concepts from the OMOP CDM vocabularies☆18Jan 19, 2026Updated last month
- Example Bots built with the Xatkit framework☆11Aug 24, 2023Updated 2 years ago
- BioELECTRA☆50Oct 27, 2021Updated 4 years ago
- As good as new. How to successfully recycle English GPT-2 to make models for other languages (ACL Findings 2021)☆48Aug 2, 2021Updated 4 years ago
- Pretraining scripts for BART transformer model☆12May 15, 2023Updated 2 years ago
- Dashboards showing intrinsic meta data for the OMOP-CDM databases in the EHDEN data network☆14Feb 12, 2026Updated 3 weeks ago
- this repo contains the draft, images, and code for the Medium blog post on altair themes.☆12Oct 8, 2018Updated 7 years ago
- Gene Set Enrichment Analysis Made Awesome☆12May 11, 2018Updated 7 years ago
- Fuente de datos de los reportajes y proyectos de periodismo de investigación y datos de DATADISTA☆328Nov 5, 2024Updated last year
- Neural Search System on Arxiv AI/ML Papers☆54Aug 4, 2021Updated 4 years ago
- ☆33Mar 1, 2023Updated 3 years ago
- ☆13Nov 26, 2024Updated last year
- BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe …☆18Jun 24, 2022Updated 3 years ago
- ☆16Dec 14, 2022Updated 3 years ago
- Analysis of various health-related datasets☆13Mar 18, 2021Updated 4 years ago
- repo for my ODSC West 2017 Talk: "Livecoding Madness: Let's Build a Deep Learning Library"☆13Nov 4, 2017Updated 8 years ago
- Music and Artificial Intelligence☆23Feb 17, 2019Updated 7 years ago
- Active Learning for Text Classification in Python☆639Feb 1, 2026Updated last month
- Minimal code to train ELMo models in recent versions of TensorFlow☆14Apr 30, 2023Updated 2 years ago
- Code and supplementary material for the HealthINF conference paper☆13Jan 19, 2021Updated 5 years ago
- Charlson Comorbidity Index Regression using Clinical Notes☆10Jul 26, 2018Updated 7 years ago