adbar / simplemma
Simple multilingual lemmatizer for Python, especially useful for speed and efficiency
☆154Updated 4 months ago
Alternatives and similar repositories for simplemma:
Users that are interested in simplemma are comparing it to the libraries listed below
- A spaCy wrapper of Entity-Fishing (component) for named entity disambiguation and linking on Wikidata☆158Updated 2 years ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆139Updated 3 months ago
- Text tokenization and sentence segmentation (segtok v2)☆202Updated 3 years ago
- 🧪 Cutting-edge experimental spaCy components and features☆97Updated 10 months ago
- A neural parsing pipeline for segmentation, morphological tagging, dependency parsing and lemmatization with pre-trained models for more …☆112Updated 10 months ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆241Updated 2 years ago
- Fuzzy matching and more functionality for spaCy.☆256Updated 8 months ago
- A python module for English lemmatization and inflection.☆265Updated last year
- Fast and robust date extraction from web pages, with Python or on the command-line☆124Updated 2 months ago
- spaCy + UDPipe☆161Updated 2 years ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆413Updated last month
- Tools for shrinking fastText models (in gensim format)☆178Updated 10 months ago
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆121Updated 10 months ago
- A spaCy wrapper of OpenTapioca for named entity linking on Wikidata☆94Updated last year
- Coreference resolution for English, French, German and Polish, optimised for limited training data and easily extensible for further lang…☆193Updated 2 years ago
- A modern, interlingual wordnet interface for Python☆235Updated 3 weeks ago
- UIMA CAS processing library written in Python☆87Updated this week
- Information extraction from English and German texts based on predicate logic☆135Updated last year
- spaCy-wrap is a wrapper library for spaCy for including fine-tuned transformers from Huggingface in your spaCy pipeline allowing you to i…☆46Updated 11 months ago
- 📂 Additional lookup tables and data resources for spaCy☆105Updated last month
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- Mini-library for producing graph visualizations from embedding models☆28Updated 4 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆71Updated 2 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆151Updated 9 months ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆105Updated last month
- A Python library for calculating a large variety of metrics from text☆331Updated 3 months ago
- 💥 Use Hugging Face text and token classification pipelines directly in spaCy☆63Updated last year
- Abydos NLP/IR library for Python☆185Updated 2 years ago
- 80x faster and 95% accurate language identification with Fasttext☆150Updated last year