projecte-aina / spacy
Pre-production releases for Spacy in Catalan
☆14Updated 3 years ago
Alternatives and similar repositories for spacy:
Users that are interested in spacy are comparing it to the libraries listed below
- An example of how to use spaCy for extremely large files without running into memory issues☆36Updated 2 years ago
- Catalan ALBERT (A Lite BERT for self-supervised learning of language representations)☆14Updated 4 years ago
- spaCy match and replace, maintaining conjugation☆35Updated 2 years ago
- Finds linguistic patterns effortlessly☆36Updated last year
- Code and models for our CLEF-HIPE (Named Entity Processing on Historical Newspapers) submissions☆19Updated 2 years ago
- ☆23Updated 2 years ago
- List of corpora annotated for coreference for different languages☆17Updated 8 months ago
- ☆19Updated 3 years ago
- German lemmatization with IWNLP as extension for spaCy☆24Updated last year
- Anonymization Pipeline for injesting data from outside of BSC that contains GDPR protected data.☆14Updated last year
- Topic modelling with SpaCy, Gensim and Textacy☆19Updated 7 years ago
- ☆64Updated 2 years ago
- 🌸 Train floret vectors☆18Updated last year
- Training a model without a dataset for natural language inference (NLI)☆25Updated 4 years ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- A spaCy custom component that extracts and normalizes temporal expressions☆54Updated 2 years ago
- TopicScan: Visualization and validation interface for NMF Topic Modeling☆23Updated 4 years ago
- A PyPI package for easy text annotation in a Jupyter Notebook.☆28Updated 3 years ago
- MoodCat😼 classifies the mood of English sentences.☆14Updated 2 years ago
- BERT and ELECTRA models trained on Europeana Newspapers☆38Updated 3 years ago
- Labeled segmentation for the document structure of printed books☆13Updated 7 years ago
- Scansion tool for Spanish texts☆12Updated last year
- SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.☆59Updated 11 months ago
- Tokenization across languages. Useful as preprocessing for subword tokenization.☆22Updated 2 years ago
- Use spaCy for NLP and output to the FoLiA XML format.☆12Updated last year
- Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.☆17Updated 8 months ago
- Preprocessing and analysis for training SNOMED-CT concept embeddings from CORD-19 corpus☆14Updated last year
- Code for "CyberWallE at SemEval-2020 Task 11: An Analysis of Feature Engineering for Ensemble Models for Propaganda Detection" (V. Blasch…☆9Updated 4 years ago
- ☆17Updated last year
- Arabic News Stance Corpus☆10Updated 4 years ago