crscardellino / sbwceLinks
Spanish Billion Word Corpus and Embeddings
☆51Updated 2 years ago
Alternatives and similar repositories for sbwce
Users that are interested in sbwce are comparing it to the libraries listed below
Sorting:
- Unannotated Spanish 3 Billion Words Corpora☆105Updated 3 years ago
- Spanish word embeddings computed with different methods and from different corpora☆363Updated 6 years ago
- Ready to use Spanish Word2Vec embeddings created from >18B chars and >3B words☆45Updated 6 years ago
- 💙 Emoji handling and meta data for spaCy with custom extension attributes☆182Updated 2 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆261Updated 3 months ago
- open datasets for sentiment analysis based on tweets in English/Spanish/French/German/Italian☆75Updated 2 years ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 5 years ago
- A fully customisable language detection pipeline for spaCy☆93Updated 6 years ago
- spaCy + UDPipe☆163Updated 3 years ago
- 💫 Jupyter notebooks for spaCy examples and tutorials☆288Updated 6 years ago
- Dataframe Integration with spaCy.☆103Updated 4 years ago
- Google USE (Universal Sentence Encoder) for spaCy☆184Updated 2 years ago
- A Python module for interfacing with the Treetagger by Helmut Schmid.☆76Updated 6 months ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆115Updated last year
- NLP French language model implementing ULMFiT☆87Updated 6 years ago
- Material para el taller "Representaciones vectoriales de palabras basadas en redes neuronales" de la Starsconf 2018☆23Updated 7 years ago
- BETO - Spanish version of the BERT model☆500Updated 2 years ago
- Hunspell extension for spaCy 2.0.☆94Updated last year
- Spanish rule-based lemmatization for spaCy☆40Updated 3 years ago
- 📂 Additional lookup tables and data resources for spaCy☆113Updated 6 months ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆418Updated 10 months ago
- List of research and engineering of NLP for American Native/Indigenous Languages.☆92Updated 5 years ago
- ☆63Updated 3 years ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆79Updated 4 years ago
- Curated list of Linguistic Resources for doing NLP & CL on Spanish☆347Updated last year
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.☆79Updated 3 years ago
- 🚀GUI for training spaCy models☆55Updated 4 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Do…☆81Updated last year
- Character-based word embeddings model based on RNN for handling real world texts☆174Updated 2 years ago
- Here are the notebooks used during the spacy youtube series.☆103Updated 4 years ago