clarinsi / classla
CLASSLA Fork of the Official Stanford NLP Python Library for Many Human Languages
☆38Updated this week
Related projects ⓘ
Alternatives and complementary repositories for classla
- 🍳 Recipes for the Prodigy, our fully scriptable annotation tool☆480Updated 3 months ago
- Fuzzy matching and more functionality for spaCy.☆252Updated 4 months ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆409Updated last month
- SpikeX - SpaCy Pipes for Knowledge Extraction☆398Updated 3 years ago
- PYthon Automated Term Extraction☆305Updated last year
- Information extraction from English and German texts based on predicate logic☆389Updated 2 years ago
- 🏖TagEditor - Annotation tool for spaCy☆187Updated 2 years ago
- Language, Knowledge, Cognition☆585Updated 3 weeks ago
- A Python library for calculating a large variety of metrics from text☆315Updated last month
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interface☆249Updated 2 months ago
- DBMDZ BERT, DistilBERT, ELECTRA, GPT-2 and ConvBERT models☆155Updated last year
- Text tokenization and sentence segmentation (segtok v2)☆203Updated 2 years ago
- Text analysis with networks.☆285Updated 6 months ago
- Fixes contractions such as `you're` to `you are`☆312Updated 2 years ago
- skweak: A software toolkit for weak supervision applied to NLP tasks☆920Updated 2 months ago
- Implementation of the ClausIE information extraction system for python+spacy☆220Updated 2 years ago
- LexRank algorithm for text summarization☆229Updated 7 months ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆192Updated last year
- 🌸 fastText + Bloom embeddings for compact, full-coverage vectors with spaCy☆287Updated last year
- Full named-entity (i.e., not tag/token) evaluation metrics based on SemEval’13☆161Updated 2 weeks ago
- Ten Thousand German News Articles Dataset for Topic Classification☆84Updated 2 years ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆118Updated 6 months ago
- This repository contains an easy and intuitive approach to few-shot NER using most similar expansion over spaCy embeddings. Now with enti…☆242Updated last year
- Toolkit to help understand "what lies" in word embeddings. Also benchmarking!☆469Updated last year
- Google USE (Universal Sentence Encoder) for spaCy☆177Updated last year
- Natural Language Processing Pipeline - Sentence Splitting, Tokenization, Lemmatization, Part-of-speech Tagging and Dependency Parsing☆555Updated 2 weeks ago
- Anonymization of legal cases (Fr) based on Flair embeddings☆87Updated 3 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆144Updated this week
- Romanian Named Entity Corpus (RONEC) version 2.0☆60Updated 2 years ago
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆153Updated 2 years ago