allo-media / text2numLinks
Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.
β108Updated 4 months ago
Alternatives and similar repositories for text2num
Users that are interested in text2num are comparing it to the libraries listed below
Sorting:
- πAn easy-to-use package to restore punctuation of the text.β118Updated 2 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiencyβ175Updated 4 months ago
- A module for normalising text.β173Updated 3 years ago
- Convert number words (eg. twenty one) to numeric digits (21)β180Updated 2 years ago
- Model for recasing and repunctuating ASR transcriptsβ139Updated last year
- βοΈContextual word checker for better suggestions (not actively maintained)β417Updated 8 months ago
- A model that predicts the punctuation of English, Italian, French and German texts.β80Updated 2 years ago
- Punctuation restoration and spell correction experiments.β251Updated 4 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languagesβ221Updated last year
- Massively multilingual pronunciation miningβ351Updated last month
- Abydos NLP/IR library for Pythonβ191Updated 2 years ago
- A merged version of multiple open-source German speech datasets.β33Updated last year
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.β254Updated 2 years ago
- A python package for deep multilingual punctuation prediction.β132Updated last year
- A versioned python wrapper package for cmudict (https://github.com/cmusphinx/cmudict).β65Updated last week
- A Python 3 phonetics library.β134Updated 5 years ago
- ππ Python 3 library for managing, annotating, and converting natural language corpuses using popular formats (CoNLL, ELAN, Praat, CSV, β¦β20Updated last year
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)β48Updated 2 years ago
- Universal Romanizer that can convert any unicode script to roman (latin) scriptβ225Updated last year
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) stringsβ89Updated last year
- Support tools for punctuation and boundary detection for ASR output.β56Updated 2 years ago
- Text tokenization and sentence segmentation (segtok v2)β206Updated 3 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.β51Updated 2 months ago
- Linguistic processing for Common Voiceβ57Updated last year
- Rust-based Python wrapper for duckling library in Haskellβ25Updated 4 years ago
- LASER multilingual sentence embeddings as a pip packageβ224Updated 2 years ago
- Source code for the Apple reproductionβ32Updated 4 years ago
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.β71Updated 2 years ago
- Multilingual syllable annotation pipeline component for spacyβ39Updated 2 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!β176Updated last week