ShailChoksi / text2digitsLinks
Converts text such as "twenty three" to number/digit "23" in any sentence
β67Updated 2 years ago
Alternatives and similar repositories for text2digits
Users that are interested in text2digits are comparing it to the libraries listed below
Sorting:
- πLanguage Model based sentences scoring libraryβ309Updated 3 years ago
- A sentence segmenter that actually works!β305Updated 5 years ago
- xfspell β the Transformer Spell Checkerβ189Updated 5 years ago
- Utilities for Processing the Switchboard Dialogue Act Corpusβ72Updated 4 years ago
- Punctuation restoration and spell correction experiments.β252Updated 4 years ago
- Automatic extraction of edited sentences from text edition histories.β83Updated 3 years ago
- Copora for evaluating NLU Services/Platforms such as Dialogflow, LUIS, Watson, Rasa etc.β114Updated 3 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)β381Updated last year
- Corpora for evaluating NLU services (like API.ai, RASA, Microsoft LUIS, ...)β147Updated 6 years ago
- Repository that accompanies "An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction" (EMNLP 2019)β213Updated 4 years ago
- Language independent truecaser in Python.β160Updated 4 years ago
- A collection of task-specific NLU datasetsβ159Updated 3 years ago
- A python true casing utility that restores case information for textsβ89Updated 2 years ago
- LASER multilingual sentence embeddings as a pip packageβ225Updated 2 years ago
- A tool that locates, downloads, and extracts machine translation corporaβ159Updated last month
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalencβ¦β56Updated last year
- One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.β123Updated 6 years ago
- Switchboard Dialog Act Corpus with Penn Treebank linksβ146Updated 4 years ago
- Extracts Transcript and Summary (Abstractive and Extractive) from the AMI Meeting Corpusβ57Updated 5 years ago
- A language model-based approach to Grammatical Error Correction for English that uses minimal annotated data.β48Updated 6 years ago
- A neural word aligner based on multilingual BERTβ358Updated 3 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languagesβ223Updated last year
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.β85Updated 6 years ago
- Efficient Low-Memory Alignerβ146Updated 9 months ago
- Repository for SLURP paperβ106Updated 3 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB teβ¦β283Updated 3 weeks ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.β160Updated last year
- Text and Punctuation correction with Deep Learningβ128Updated 5 years ago
- β49Updated last year
- Intelligently expand and create contractions in text leveraging grammar checking and Word Mover's Distance.β79Updated 3 years ago