nkthiebaut / zeugmaLinks
πNatural language processing (NLP) utils: word embeddings (Word2Vec, GloVe, FastText, ...) and preprocessing transformers, compatible with scikit-learn Pipelines. π
β63Updated 2 years ago
Alternatives and similar repositories for zeugma
Users that are interested in zeugma are comparing it to the libraries listed below
Sorting:
- Sentence transformers models for SpaCyβ109Updated 2 years ago
- spaCy pipeline object for negating concepts in textβ282Updated 7 months ago
- Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasksβ159Updated 2 years ago
- Google USE (Universal Sentence Encoder) for spaCyβ184Updated 2 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality β¦β105Updated last year
- spaCy + UDPipeβ165Updated 3 years ago
- Python library for Natural Language Preprocessing (NLPre)β192Updated 2 years ago
- PYthon Automated Term Extractionβ318Updated 2 years ago
- Character-based word embeddings model based on RNN for handling real worldΒ textsβ174Updated 2 years ago
- Word Embeddings for Information Retrievalβ225Updated 2 years ago
- Anonymization of legal cases (Fr) based on Flair embeddingsβ87Updated 5 years ago
- An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the tβ¦β222Updated last year
- SImple SenTence EmbeddeRβ74Updated 2 years ago
- Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doβ¦β81Updated last year
- Use ML-Annotate to label data for machine learning purposesβ110Updated 5 years ago
- Inter-annotator agreement for Doccanoβ28Updated 5 years ago
- Fuzzy matching and more functionality for spaCy.β259Updated last year
- Creating class-based TF-IDF matricesβ91Updated 3 years ago
- A fully customisable language detection pipeline for spaCyβ93Updated 6 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer modelsβ64Updated 3 years ago
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.β79Updated 4 years ago
- Text tokenization and sentence segmentation (segtok v2)β208Updated 3 years ago
- π€ Calculate average word embeddings (word2vec) from documents for transfer learningβ54Updated last year
- β73Updated 7 years ago
- All the goto functions you need to handle NLP use-cases, integrated in NLPretextβ142Updated 9 months ago
- A monolingual and cross-lingual meta-embedding generation and evaluation frameworkβ79Updated 3 years ago
- Custom Natural Language Processing with big and small models π²π±β66Updated 4 years ago
- Semantic search using Transformers and othersβ110Updated 5 years ago
- NLP French language model implementing ULMFiTβ87Updated 6 years ago
- π§ͺ Cutting-edge experimental spaCy components and featuresβ105Updated last year