mhagiwara / xfspellLinks
xfspell — the Transformer Spell Checker
☆190Updated 5 years ago
Alternatives and similar repositories for xfspell
Users that are interested in xfspell are comparing it to the libraries listed below
Sorting:
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Updated 4 years ago
- LASER multilingual sentence embeddings as a pip package☆224Updated last year
- Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.☆255Updated 2 years ago
- Fast and accurate spell correction library☆81Updated 3 years ago
- Create interactive textual heat maps for Jupiter notebooks☆196Updated last year
- Many Natural Language Processing tasks rely on sentence boundary detection (SBD). Although amazing libraries like spacy provide state of …☆61Updated 4 years ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- 📃Language Model based sentences scoring library☆309Updated 3 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- A sentence segmenter that actually works!☆305Updated 4 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago
- Build a dialog dataset from online books in many languages☆76Updated 2 years ago
- A collection of task-specific NLU datasets☆151Updated 3 years ago
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆155Updated last year
- ☆72Updated 7 years ago
- A python true casing utility that restores case information for texts☆89Updated 2 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆160Updated 10 months ago
- Segment documents into coherent parts using word embeddings.☆149Updated 3 years ago
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.☆84Updated 5 years ago
- Question-answers, collected from Google☆129Updated 4 years ago
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 4 years ago
- Semantic search using Transformers and others☆110Updated 4 years ago
- New dataset☆306Updated 3 years ago
- Load What You Need: Smaller Multilingual Transformers for Pytorch and TensorFlow 2.0.☆105Updated 3 years ago
- LM Pretraining with PyTorch/TPU☆135Updated 5 years ago
- ☆103Updated 4 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Updated 4 years ago
- Code for obtaining the Curation Corpus abstractive text summarisation dataset☆128Updated 4 years ago
- On the Stability of Fine-tuning BERT: Misconceptions, Explanations, and Strong Baselines☆136Updated last year