pablodms / spacy-spanish-lemmatizer
Spanish rule-based lemmatization for spaCy
☆37Updated 2 years ago
Related projects: ⓘ
- Unannotated Spanish 3 Billion Words Corpora☆91Updated last year
- Ready to use Spanish Word2Vec embeddings created from >18B chars and >3B words☆38Updated 5 years ago
- A pre-trained language model for social media text in Spanish☆34Updated last year
- Spanish Billion Word Corpus and Embeddings☆45Updated last year
- Spanish data from the AnCora corpus.☆28Updated 3 months ago
- Spanish word embeddings computed with different methods and from different corpora☆353Updated 4 years ago
- BETO - Spanish version of the BERT model☆488Updated 10 months ago
- Official source for spanish Language Models and resources made @ BSC-TEMU within the "Plan de las Tecnologías del Lenguaje" (Plan-TL).☆249Updated last year
- Information extraction from English and German texts based on predicate logic☆133Updated last year
- ☆61Updated last year
- Curso práctico: NLP de cero a cien 🤗☆182Updated 5 months ago
- Clean personally identifiable information from dirty dirty text using spaCy.☆40Updated last year
- ☆65Updated 2 years ago
- Language detection using Spacy and Fasttext☆53Updated 9 months ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆151Updated last year
- Dataframe Integration with spaCy.☆100Updated 3 years ago
- spaCy match and replace, maintaining conjugation☆34Updated last year
- 📝Natural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible wi…☆60Updated last year
- Material para el taller "Representaciones vectoriales de palabras basadas en redes neuronales" de la Starsconf 2018☆23Updated 5 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆69Updated last year
- Healthsea is a spaCy pipeline for analyzing user reviews of supplementary products for their effects on health.☆87Updated 2 years ago
- A Python module to convert natural language numerics into ints and floats.☆211Updated last year
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated 6 months ago
- Explainable Zero-Shot Topic Extraction☆62Updated last month
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆61Updated 6 months ago
- Fuzzy matching and more functionality for spaCy.☆249Updated 2 months ago
- ☄️ Parallel and distributed training with spaCy and Ray☆54Updated last year
- 🧬 A JupyterLab extension for annotating data with Prodigy☆188Updated last year
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆90Updated last year
- NeatText a simple NLP package for cleaning textual data and text preprocessing☆66Updated 9 months ago