clarinsi / classla
CLASSLA Fork of the Official Stanford NLP Python Library for Many Human Languages
☆41Updated last week
Alternatives and similar repositories for classla
Users that are interested in classla are comparing it to the libraries listed below
Sorting:
- A Python library for calculating a large variety of metrics from text☆337Updated 4 months ago
- Fuzzy matching and more functionality for spaCy.☆256Updated 10 months ago
- This repository contains an easy and intuitive approach to few-shot classification using sentence-transformers or spaCy models, or zero-s…☆214Updated 3 months ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆156Updated this week
- Text tokenization and sentence segmentation (segtok v2)☆202Updated 3 years ago
- A Dutch RoBERTa-based language model☆203Updated last year
- spaCy + UDPipe☆161Updated 3 years ago
- spaCy pipeline object for negating concepts in text☆279Updated 11 months ago
- Google USE (Universal Sentence Encoder) for spaCy☆184Updated 2 years ago
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks☆159Updated 2 years ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- PYthon Automated Term Extraction☆311Updated 2 years ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆245Updated last year
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆157Updated 2 years ago
- BERTje is a Dutch pre-trained BERT model developed at the University of Groningen. (EMNLP Findings 2020) "What’s so special about BERT’s …☆138Updated 2 years ago
- SpikeX - SpaCy Pipes for Knowledge Extraction☆398Updated 3 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆122Updated last year
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆161Updated 2 years ago
- 🤘Lemmy is a lemmatizer for Danish 🇩🇰 and Swedish 🇸🇪☆77Updated 3 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆246Updated 2 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆255Updated 8 months ago
- A Dataset of German Legal Documents for Named Entity Recognition☆168Updated 2 years ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆413Updated 3 months ago
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆179Updated last month
- E3C is a freely available multilingual corpus (Italian, English, French, Spanish, and Basque) of semantically annotated clinical narrativ…☆25Updated last year
- Implementation of the ClausIE information extraction system for python+spacy☆222Updated 2 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆195Updated 2 years ago
- UIMA CAS processing library written in Python☆88Updated last month
- Spacy NER annotator using ipywidgets☆121Updated last year
- Anonymization of legal cases (Fr) based on Flair embeddings☆88Updated 4 years ago