ShailChoksi / text2digitsLinks
Converts text such as "twenty three" to number/digit "23" in any sentence
β67Updated 2 years ago
Alternatives and similar repositories for text2digits
Users that are interested in text2digits are comparing it to the libraries listed below
Sorting:
- πLanguage Model based sentences scoring libraryβ309Updated 3 years ago
- xfspell β the Transformer Spell Checkerβ190Updated 5 years ago
- Punctuation restoration and spell correction experiments.β251Updated 4 years ago
- Automatic extraction of edited sentences from text edition histories.β83Updated 3 years ago
- A python true casing utility that restores case information for textsβ89Updated 2 years ago
- A sentence segmenter that actually works!β306Updated 4 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2β114Updated 6 years ago
- A tool that locates, downloads, and extracts machine translation corporaβ156Updated 2 months ago
- Python library & examples for Masked Language Model Scoring (ACL 2020)β344Updated 2 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)β373Updated last year
- β74Updated 4 months ago
- OpusFilter - Parallel corpus processing toolkitβ109Updated this week
- Universal Romanizer that can convert any unicode script to roman (latin) scriptβ214Updated last year
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.β158Updated last year
- LASER multilingual sentence embeddings as a pip packageβ224Updated 2 years ago
- Easier Automatic Sentence Simplification Evaluationβ161Updated last year
- Easy-to-use word-to-word translations for 3,564 language pairs.β366Updated 4 years ago
- Convert number words (eg. twenty one) to numeric digits (21)β177Updated last year
- A collection of task-specific NLU datasetsβ151Updated 3 years ago
- Code to reproduce the experiments from the paper.β101Updated last year
- β49Updated last year
- Corpora for evaluating NLU services (like API.ai, RASA, Microsoft LUIS, ...)β146Updated 5 years ago
- Language independent truecaser in Python.β159Updated 3 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB teβ¦β282Updated 6 months ago
- Build a dialog dataset from online books in many languagesβ76Updated 2 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languagesβ219Updated last year
- Byte Pair Encoding for Python!β230Updated 2 years ago
- A language model-based approach to Grammatical Error Correction for English that uses minimal annotated data.β48Updated 6 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasingβ101Updated last year
- Extracts Transcript and Summary (Abstractive and Extractive) from the AMI Meeting Corpusβ55Updated 5 years ago