allo-media / text2numLinks
Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.
☆109Updated 5 months ago
Alternatives and similar repositories for text2num
Users that are interested in text2num are comparing it to the libraries listed below
Sorting:
- A module for normalising text.☆173Updated 4 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆177Updated 4 months ago
- 📝An easy-to-use package to restore punctuation of the text.☆119Updated 2 years ago
- Punctuation restoration and spell correction experiments.☆251Updated 4 years ago
- Convert number words (eg. twenty one) to numeric digits (21)☆180Updated 2 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆80Updated 2 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆223Updated last year
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆255Updated 2 years ago
- Abydos NLP/IR library for Python☆191Updated 2 years ago
- A python package for deep multilingual punctuation prediction.☆133Updated last year
- Massively multilingual pronunciation mining☆354Updated 2 months ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆416Updated 8 months ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆226Updated last year
- A Python 3 phonetics library.☆134Updated 5 years ago
- A merged version of multiple open-source German speech datasets.☆33Updated last year
- Model for recasing and repunctuating ASR transcripts☆141Updated last year
- Rust-based Python wrapper for duckling library in Haskell☆25Updated 4 years ago
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆281Updated last week
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆50Updated last year
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆52Updated 3 weeks ago
- ☆49Updated last year
- Support tools for punctuation and boundary detection for ASR output.☆56Updated 2 years ago
- A python true casing utility that restores case information for texts☆89Updated 2 years ago
- Linguistic processing for Common Voice☆57Updated last year
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Updated 6 years ago
- ☆174Updated 7 months ago
- Python3 bindings for the Compact Language Detector v3 (CLD3)☆154Updated 2 years ago
- Text and Punctuation correction with Deep Learning☆128Updated 5 years ago
- Faster, modernized fork of the language identification tool langid.py☆59Updated 11 months ago
- A spaCy custom component that extracts and normalizes temporal expressions☆55Updated 2 years ago