nkthiebaut / zeugma
πNatural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible with scikit-learn Pipelines. π
β63Updated last year
Alternatives and similar repositories for zeugma:
Users that are interested in zeugma are comparing it to the libraries listed below
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doβ¦β80Updated 9 months ago
- Exploring the simple sentence similarity measurements using word embeddingsβ101Updated 7 months ago
- spaCy + UDPipeβ161Updated 2 years ago
- Python library for Natural Language Preprocessing (NLPre)β190Updated last year
- Anonymization of legal cases (Fr) based on Flair embeddingsβ88Updated 4 years ago
- Running Prodigy for a team of annotatorsβ53Updated 4 years ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.β77Updated 3 years ago
- A visualisation tool for Spacy using Hierplane.β65Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated last year
- Language detection extension for spaCy 2.0+β112Updated 6 years ago
- Template for AC297r projectsβ32Updated 5 years ago
- spaCy pipeline object for negating concepts in textβ279Updated 9 months ago
- Text tokenization and sentence segmentation (segtok v2)β202Updated 3 years ago
- PYthon Automated Term Extractionβ311Updated 2 years ago
- Sentence transformers models for SpaCyβ107Updated 2 years ago
- π Emoji handling and meta data for spaCy with custom extension attributesβ181Updated last year
- Regular spotlights of underrated NLP and Data Science GitHub repositoriesβ35Updated 4 years ago
- π€ Calculate average word embeddings (word2vec) from documents for transfer learningβ54Updated 10 months ago
- Language Models for Zalando's flair libraryβ61Updated 5 years ago
- Dataframe Integration with spaCy.β103Updated 4 years ago
- Implementation of GloVe in Kerasβ45Updated 2 years ago
- An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the tβ¦β220Updated 9 months ago
- Inter-annotator agreement for Doccanoβ27Updated 4 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer modelsβ65Updated 2 years ago
- spaCy pipeline component for generating spaCy KnowledgeBase Alias Candidates for Entity Linkingβ85Updated 2 years ago
- A monolingual and cross-lingual meta-embedding generation and evaluation frameworkβ80Updated 2 years ago
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasksβ158Updated 2 years ago
- Named Entity Recognition based on dictionariesβ242Updated 6 years ago
- β72Updated 6 years ago
- π Additional lookup tables and data resources for spaCyβ105Updated 2 months ago