ShailChoksi / text2digits
Converts text such as "twenty three" to number/digit "23" in any sentence
☆67Updated 2 years ago
Alternatives and similar repositories for text2digits:
Users that are interested in text2digits are comparing it to the libraries listed below
- xfspell — the Transformer Spell Checker☆188Updated 4 years ago
- Language independent truecaser in Python.☆160Updated 3 years ago
- Convert number words (eg. twenty one) to numeric digits (21)☆175Updated last year
- A python true casing utility that restores case information for texts☆88Updated 2 years ago
- Automatic extraction of edited sentences from text edition histories.☆82Updated 3 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆154Updated 8 months ago
- Punctuation restoration and spell correction experiments.☆251Updated 4 years ago
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.☆83Updated 5 years ago
- 📃Language Model based sentences scoring library☆307Updated 3 years ago
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2☆114Updated 5 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆150Updated this week
- 📝An easy-to-use package to restore punctuation of the text.☆113Updated last year
- Python wrapper for wit.ai's Duckling Clojure library☆131Updated 3 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆191Updated 4 years ago
- Large corpus of uncompressed and compressed sentences from news articles.☆123Updated 7 years ago
- A sentence segmenter that actually works!☆304Updated 4 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasing☆101Updated 7 months ago
- ☆45Updated 7 months ago
- Pre-trained models and code and data to train and use models from "Pushing the Limits of Paraphrastic Sentence Embeddings with Millions o…☆101Updated last year
- Switchboard Dialog Act Corpus with Penn Treebank links☆144Updated 4 years ago
- Misspelling Oblivious Word Embeddings☆203Updated 5 years ago
- Build a dialog dataset from online books in many languages☆72Updated 2 years ago
- ☆72Updated 6 years ago
- Examples for aligning, padding and batching sequence labeling data (NER) for use with pre-trained transformer models☆65Updated 2 years ago
- Morfessor EM+Prune☆10Updated 4 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆357Updated last year
- Byte Pair Encoding for Python!☆227Updated 2 years ago
- Code and data used in named entity transliteration experiments☆57Updated 6 years ago
- Stanford's Alexa Prize socialbot☆133Updated last year
- Efficient Low-Memory Aligner☆142Updated last month