ShailChoksi / text2digitsLinks
Converts text such as "twenty three" to number/digit "23" in any sentence
β67Updated 3 years ago
Alternatives and similar repositories for text2digits
Users that are interested in text2digits are comparing it to the libraries listed below
Sorting:
- πLanguage Model based sentences scoring libraryβ308Updated 3 years ago
- Punctuation restoration and spell correction experiments.β252Updated 4 years ago
- A python true casing utility that restores case information for textsβ88Updated 3 years ago
- A sentence segmenter that actually works!β304Updated 5 years ago
- xfspell β the Transformer Spell Checkerβ189Updated 5 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)β385Updated 2 years ago
- Segment documents into coherent parts using word embeddings.β149Updated 3 years ago
- Text and Punctuation correction with Deep Learningβ128Updated 5 years ago
- Switchboard Dialog Act Corpus with Penn Treebank linksβ146Updated 4 years ago
- A tool that locates, downloads, and extracts machine translation corporaβ159Updated 3 months ago
- A curated list of research papers and resources on code-switchingβ329Updated last year
- Utilities for Processing the Switchboard Dialogue Act Corpusβ72Updated 4 years ago
- Automatic extraction of edited sentences from text edition histories.β83Updated 3 years ago
- Repository that accompanies "An Evaluation Dataset for Intent Classification and Out-of-Scope Prediction" (EMNLP 2019)β216Updated 4 years ago
- Python library & examples for Masked Language Model Scoring (ACL 2020)β347Updated 3 years ago
- A python module for English lemmatization and inflection.β274Updated 2 years ago
- Language independent truecaser in Python.β159Updated 4 years ago
- A language model-based approach to Grammatical Error Correction for English that uses minimal annotated data.β48Updated 6 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languagesβ225Updated last year
- A collection of task-specific NLU datasetsβ160Updated 3 years ago
- ERRor ANnotation Toolkit: Automatically extract and classify grammatical errors in parallel original and corrected sentences.β456Updated last year
- (yet another not really) awesome topic/text segmentation listβ108Updated 7 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2β115Updated 6 years ago
- Convert number words (eg. twenty one) to numeric digits (21)β180Updated 2 years ago
- Scripts and links to recreate the ELI5 dataset.β326Updated 4 years ago
- Copora for evaluating NLU Services/Platforms such as Dialogflow, LUIS, Watson, Rasa etc.β114Updated 3 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB teβ¦β287Updated 2 months ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasingβ102Updated last year
- Corpora for evaluating NLU services (like API.ai, RASA, Microsoft LUIS, ...)β147Updated 6 years ago
- A neural word aligner based on multilingual BERTβ362Updated 3 years ago