pablodms / spacy-spanish-lemmatizer
Spanish rule-based lemmatization for spaCy
☆38Updated 2 years ago
Alternatives and similar repositories for spacy-spanish-lemmatizer:
Users that are interested in spacy-spanish-lemmatizer are comparing it to the libraries listed below
- Unannotated Spanish 3 Billion Words Corpora☆94Updated 2 years ago
- Spanish Billion Word Corpus and Embeddings☆46Updated 2 years ago
- A pre-trained language model for social media text in Spanish☆34Updated last year
- Spanish word embeddings computed with different methods and from different corpora☆354Updated 5 years ago
- ☆68Updated 2 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- German sentiment scores with SentiWS as extension for spaCy☆36Updated 2 years ago
- Information extraction from English and German texts based on predicate logic☆135Updated last year
- BETO - Spanish version of the BERT model☆491Updated last year
- The project proposes a framework to apply topic models on a text-corpus and eventually topic labels on the generated topics.☆35Updated 8 months ago
- Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).☆254Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆93Updated last year
- Scansion tool for Spanish texts☆11Updated last year
- A monolingual and cross-lingual meta-embedding generation and evaluation framework☆80Updated 2 years ago
- 💥 Explosion Assets☆43Updated last year
- Tutorial for Topic Modelling using PySpark and Spark NLP☆16Updated 4 years ago
- Portuguese BERT and XLM-R models fine-tuned in semantic role labeling.☆22Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressions☆52Updated last year
- Topic Inference with Zeroshot models☆61Updated last year
- Regular spotlights of underrated NLP and Data Science GitHub repositories☆35Updated 4 years ago
- An easy-to-use library to extract indices from texts.☆29Updated 3 years ago
- Spanish data from the AnCora corpus.☆29Updated 2 months ago
- ☆12Updated 3 years ago
- Fuzzy matching and more functionality for spaCy.☆255Updated 6 months ago
- Clean personally identifiable information from dirty dirty text using spaCy.☆41Updated last year
- Generic Environment for Context-Aware Correction of Orthography☆22Updated 2 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆156Updated 2 years ago
- 📝Natural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible wi…☆62Updated last year
- 🧪 Cutting-edge experimental spaCy components and features☆96Updated 8 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 10 months ago