yannvgn / laserembeddings
LASER multilingual sentence embeddings as a pip package
☆223Updated last year
Alternatives and similar repositories for laserembeddings:
Users that are interested in laserembeddings are comparing it to the libraries listed below
- Text tokenization and sentence segmentation (segtok v2)☆202Updated 3 years ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆223Updated 2 years ago
- Google USE (Universal Sentence Encoder) for spaCy☆184Updated 2 years ago
- A simple client for doccano API.☆85Updated 11 months ago
- xfspell — the Transformer Spell Checker☆190Updated 4 years ago
- Easier Automatic Sentence Simplification Evaluation☆160Updated last year
- A python true casing utility that restores case information for texts☆88Updated 2 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆360Updated last year
- Fuzzy matching and more functionality for spaCy.☆256Updated 10 months ago
- An implementation of a full named-entity evaluation metrics based on SemEval'13 Task 9 - not at tag/token level but considering all the t…☆220Updated 10 months ago
- OpusFilter - Parallel corpus processing toolkit☆104Updated last month
- spaCy + UDPipe☆161Updated 3 years ago
- Making BERT stretchy. Semantic Elasticsearch with Sentence Transformers☆160Updated 4 years ago
- Document ranking via sentence modeling using BERT☆144Updated 2 years ago
- Implementation of unsupervised smoothed inverse frequency (Best Paper, Repl4NLP @ ACL 2018)☆77Updated 6 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆157Updated 10 months ago
- Sentence transformers models for SpaCy☆107Updated 2 years ago
- Simple State-of-the-Art BERT-Based Sentence Classification with Keras / TensorFlow 2. Built with HuggingFace's Transformers.☆200Updated 11 months ago
- Builds wordpiece(subword) vocabulary compatible for Google Research's BERT☆229Updated 4 years ago
- Recon NER, Debug and correct annotated Named Entity Recognition (NER) data for inconsistencies and get insights on improving the quality …☆106Updated last year
- Language model fine-tuning on NER with an easy interface and cross-domain evaluation. "T-NER: An All-Round Python Library for Transformer…☆387Updated last year
- ☆72Updated 6 years ago
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆153Updated 11 months ago
- Rank-based Unsupervised Keyword Extraction via Metavertex Aggregation☆99Updated 5 months ago
- A Corpus for Multilingual Document Classification in Eight Languages.☆151Updated 2 years ago
- Preprocessing Library for Natural Language Processing☆161Updated 2 years ago
- SImple SenTence EmbeddeR☆74Updated 2 years ago
- Neural network models for joint POS tagging and dependency parsing (CoNLL 2017-2018)☆158Updated 5 years ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago