nkthiebaut / zeugmaLinks
πNatural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible with scikit-learn Pipelines. π
β63Updated 2 years ago
Alternatives and similar repositories for zeugma
Users that are interested in zeugma are comparing it to the libraries listed below
Sorting:
- spaCy pipeline object for negating concepts in textβ281Updated 2 months ago
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasksβ159Updated 2 years ago
- Sentence transformers models for SpaCyβ107Updated 2 years ago
- Google USE (Universal Sentence Encoder) for spaCyβ184Updated 2 years ago
- PYthon Automated Term Extractionβ315Updated 2 years ago
- Character-based word embeddings model based on RNN for handling real worldΒ textsβ173Updated last year
- Text tokenization and sentence segmentation (segtok v2)β205Updated 3 years ago
- Python library for Natural Language Preprocessing (NLPre)β191Updated 2 years ago
- Anonymization of legal cases (Fr) based on Flair embeddingsβ87Updated 4 years ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretextβ140Updated 5 months ago
- Language Models for Zalando's flair libraryβ61Updated 5 years ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.β77Updated 3 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer modelsβ65Updated 2 years ago
- Creating class-based TF-IDF matricesβ89Updated 2 years ago
- spaCy + UDPipeβ163Updated 3 years ago
- Fixes contractions such as `you're` to `you are`β317Updated 2 years ago
- Fuzzy matching and more functionality for spaCy.β256Updated last year
- Word Embeddings for Information Retrievalβ225Updated last year
- A fully customisable language detection pipeline for spaCyβ93Updated 6 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β106Updated last year
- Use ML-Annotate to label data for machine learning purposesβ111Updated 5 years ago
- π Easy training and deployment of seq2seq models.β227Updated 4 years ago
- SImple SenTence EmbeddeRβ74Updated 2 years ago
- π Emoji handling and meta data for spaCy with custom extension attributesβ181Updated 2 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doβ¦β81Updated last year
- Applying BERT to named entity recognition in English and Russian.β162Updated 2 years ago
- shabeelkandi / Handling-Out-of-Vocabulary-Words-in-Natural-Language-Processing-using-Language-Modellingβ69Updated 5 years ago
- An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the tβ¦β221Updated last year
- Steam review texting embedding analysisβ142Updated 2 years ago
- π€ Calculate average word embeddings (word2vec) from documents for transfer learningβ54Updated last year