ShailChoksi / text2digitsLinks
Converts text such as "twenty three" to number/digit "23" in any sentence
β67Updated 3 years ago
Alternatives and similar repositories for text2digits
Users that are interested in text2digits are comparing it to the libraries listed below
Sorting:
- πLanguage Model based sentences scoring libraryβ309Updated 4 years ago
- xfspell β the Transformer Spell Checkerβ189Updated 5 years ago
- Byte Pair Encoding for Python!β232Updated 3 years ago
- A python true casing utility that restores case information for textsβ88Updated 3 years ago
- A sentence segmenter that actually works!β304Updated 5 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2β115Updated 6 years ago
- Convert number words (eg. twenty one) to numeric digits (21)β180Updated 2 years ago
- Punctuation restoration and spell correction experiments.β252Updated 4 years ago
- Language independent truecaser in Python.β160Updated 4 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)β386Updated 2 years ago
- πAn easy-to-use package to restore punctuation of the text.β119Updated 2 years ago
- A tool that locates, downloads, and extracts machine translation corporaβ162Updated 4 months ago
- Repository for SLURP paperβ108Updated 3 years ago
- Text and Punctuation correction with Deep Learningβ128Updated 5 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languagesβ227Updated last year
- Utilities for Processing the Switchboard Dialogue Act Corpusβ73Updated 5 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasingβ102Updated last year
- Easier Automatic Sentence Simplification Evaluationβ166Updated 2 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB teβ¦β295Updated this week
- Universal Romanizer that can convert any unicode script to roman (latin) scriptβ237Updated last year
- A language model-based approach to Grammatical Error Correction for English that uses minimal annotated data.β48Updated 7 years ago
- A guide to building language technology in new languages.β59Updated 4 years ago
- This tool helps automatic generation of grammatically valid synthetic Code-mixed data by utilizing linguistic theories such as Equivalencβ¦β58Updated last year
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences froβ¦β162Updated last year
- β94Updated last year
- Python library & examples for Masked Language Model Scoring (ACL 2020)β347Updated 3 years ago
- LASER multilingual sentence embeddings as a pip packageβ224Updated 2 years ago
- Extracts Transcript and Summary (Abstractive and Extractive) from the AMI Meeting Corpusβ57Updated 6 years ago
- Improved Sentence Alignment in Linear Time and Spaceβ188Updated 2 years ago
- Corpora for evaluating NLU services (like API.ai, RASA, Microsoft LUIS, ...)β147Updated 6 years ago