aitoralmeida / spanish_word2vec
Ready to use Spanish Word2Vec embeddings created from >18B chars and >3B words
☆41Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for spanish_word2vec
- Spanish Billion Word Corpus and Embeddings☆45Updated last year
- Spanish word embeddings computed with different methods and from different corpora☆356Updated 5 years ago
- Unannotated Spanish 3 Billion Words Corpora☆92Updated 2 years ago
- Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).☆253Updated last year
- WEFE: The Word Embeddings Fairness Evaluation Framework. WEFE is a framework that standardizes the bias measurement and mitigation in Wor…☆175Updated 5 months ago
- ☆61Updated last year
- BETO - Spanish version of the BERT model☆492Updated last year
- Material para el taller "Representaciones vectoriales de palabras basadas en redes neuronales" de la Starsconf 2018☆23Updated 6 years ago
- A sentiment Analysis classifier in spanish☆121Updated 9 months ago
- Code for the CUP Elements on text analysis in Python for social scientists☆135Updated 2 years ago
- ☆39Updated 3 years ago
- Scrape news articles and analyze them using NLP to quantify the gender gap in Canadian mainstream media☆39Updated 6 months ago
- Dataframe Integration with spaCy.☆102Updated 3 years ago
- Ejercicios para aprender a hacer NLP impulsado por las librerías de Hugging Face.☆24Updated 2 years ago
- Fine-tuning a Hugging Face BERT model for the United Nations Named Entity Recognition task.☆31Updated 3 years ago
- Project files related to topic modeling of NYT articles regarding mental health☆17Updated 6 years ago
- Curso práctico: NLP de cero a cien 🤗☆183Updated 7 months ago
- Notebooks configured to be run with Binder, usually found on my blog.☆41Updated last year
- Spanish rule-based lemmatization for spaCy☆37Updated 2 years ago
- A pipeline for NLP projects using SkLearn☆24Updated 6 years ago
- Code for the paper "Content Analysis of Textbooks via Natural Language Processing".☆56Updated last year
- Specialization of BERT architecture both for the Spanish language and the Twitter domain☆13Updated 4 years ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆75Updated 3 years ago
- Interpretable data visualizations for understanding how texts differ at the word level☆273Updated 4 months ago
- Lista de corpus de PLN en español ✨ #Somos600M: Ayuda a desarrollar IA inclusiva que entienda las diferentes variedades de nuestras lengu…☆18Updated 10 months ago
- Clean personally identifiable information from dirty dirty text using spaCy.☆40Updated last year
- Bag of, not words, but tricks!☆68Updated last year
- Text analysis with networks.☆284Updated 6 months ago
- AlBERTo the first italian BERT model for Twitter languange understanding☆71Updated 4 years ago
- open datasets for sentiment analysis based on tweets in English/Spanish/French/German/Italian☆72Updated last year