ShailChoksi / text2digits
Converts text such as "twenty three" to number/digit "23" in any sentence
β67Updated 2 years ago
Alternatives and similar repositories for text2digits
Users that are interested in text2digits are comparing it to the libraries listed below
Sorting:
- Language independent truecaser in Python.β160Updated 3 years ago
- πLanguage Model based sentences scoring libraryβ308Updated 3 years ago
- A python true casing utility that restores case information for textsβ88Updated 2 years ago
- A program to choose transfer languages for cross-lingual learningβ72Updated last year
- A tool that locates, downloads, and extracts machine translation corporaβ154Updated 2 weeks ago
- A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.β316Updated 2 months ago
- Code to reproduce the experiments from the paper.β101Updated last year
- Utilities for Processing the Switchboard Dialogue Act Corpusβ69Updated 4 years ago
- β92Updated last year
- Training an n-gram based Language Model using KenLM toolkit for Deep Speech 2β114Updated 5 years ago
- Compound splitter for Germanβ105Updated 5 years ago
- Efficient Low-Memory Alignerβ143Updated 4 months ago
- Demonstration of the results in "Text Normalization using Memory Augmented Neural Networks", Authors: Subhojeet Pramanik, Aman Hussainβ60Updated 5 years ago
- MT Evaluation in Many Languages via Zero-Shot Paraphrasingβ101Updated 9 months ago
- Team Kakao&Brain's Grammatical Error Correction System for the ACL 2019 BEA Shared Taskβ92Updated 5 years ago
- Automatic extraction of edited sentences from text edition histories.β83Updated 3 years ago
- β42Updated 3 years ago
- Convert number words (eg. twenty one) to numeric digits (21)β176Updated last year
- Complimentary code for our paper Automatic punctuation restoration with BERT modelsβ49Updated last year
- General-Purpose Neural Networks for Sentence Boundary Detectionβ73Updated 2 years ago
- Guidelines.β97Updated 9 months ago
- A guide to building language technology in new languages.β58Updated 3 years ago
- A simple library for querying the URIEL typological database.β90Updated last year
- One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.β123Updated 5 years ago
- Morfessor EM+Pruneβ10Updated 4 years ago
- Text and Punctuation correction with Deep Learningβ128Updated 5 years ago
- β47Updated 9 months ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.β157Updated 10 months ago
- Use Language Model (LM) for Grammar Error Correction (GEC), without the use of annotated data.β83Updated 5 years ago
- spacy-wordnet creates annotations that easily allow the use of wordnet and wordnet domains by using the nltk wordnet interfaceβ255Updated 8 months ago