mhagiwara / xfspellLinks
xfspell — the Transformer Spell Checker
☆190Updated 5 years ago
Alternatives and similar repositories for xfspell
Users that are interested in xfspell are comparing it to the libraries listed below
Sorting:
- Accelerated NLP pipelines for fast inference on CPU. Built with Transformers and ONNX runtime.☆127Updated 4 years ago
- LASER multilingual sentence embeddings as a pip package☆224Updated last year
- A python true casing utility that restores case information for texts☆89Updated 2 years ago
- A sentence segmenter that actually works!☆306Updated 4 years ago
- Automatic extraction of edited sentences from text edition histories.☆83Updated 3 years ago
- Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.☆254Updated 2 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆160Updated 9 months ago
- An asynchronous concurrent pipeline for classifying Common Crawl based on fastText's pipeline.☆86Updated 4 years ago
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 4 years ago
- Build a dialog dataset from online books in many languages☆75Updated 2 years ago
- Code to reproduce the experiments from the paper.☆101Updated last year
- Create interactive textual heat maps for Jupiter notebooks☆196Updated last year
- Fast and accurate spell correction library☆81Updated 3 years ago
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.☆84Updated 5 years ago
- 📃Language Model based sentences scoring library☆308Updated 3 years ago
- The Dakshina dataset is a collection of text in both Latin and native scripts for 12 South Asian languages. For each language, the datase…☆195Updated 5 years ago
- MILES is a multilingual text simplifier inspired by LSBert - A BERT-based lexical simplification approach proposed in 2018. Unlike LSBert…☆49Updated 4 years ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆223Updated 2 years ago
- Question-answers, collected from Google☆129Updated 3 years ago
- ☆72Updated 7 years ago
- XtremeDistil framework for distilling/compressing massive multilingual neural network models to tiny and efficient models for AI at scale☆155Updated last year
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆158Updated last year
- Interpretable Evaluation for (Almost) All NLP Tasks☆195Updated 2 years ago
- Code for the Shortformer model, from the ACL 2021 paper by Ofir Press, Noah A. Smith and Mike Lewis.☆147Updated 3 years ago
- A collection of task-specific NLU datasets☆149Updated 3 years ago
- Text and Punctuation correction with Deep Learning☆128Updated 5 years ago
- Semantic search using Transformers and others☆110Updated 4 years ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆119Updated 3 years ago
- This repository contains the code for "Generating Datasets with Pretrained Language Models".☆188Updated 3 years ago