allo-media / text2numLinks
Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.
☆112Updated 2 weeks ago
Alternatives and similar repositories for text2num
Users that are interested in text2num are comparing it to the libraries listed below
Sorting:
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆182Updated 7 months ago
- A module for normalising text.☆172Updated 4 years ago
- 📝An easy-to-use package to restore punctuation of the text.☆119Updated 2 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languages☆227Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.☆83Updated 2 years ago
- Convert number words (eg. twenty one) to numeric digits (21)☆180Updated 2 years ago
- Massively multilingual pronunciation mining☆360Updated 2 weeks ago
- A Python 3 phonetics library.☆137Updated 5 years ago
- ✔️Contextual word checker for better suggestions (not actively maintained)☆418Updated 11 months ago
- Punctuation restoration and spell correction experiments.☆252Updated 4 years ago
- A merged version of multiple open-source German speech datasets.☆34Updated last year
- Abydos NLP/IR library for Python☆194Updated 3 years ago
- A Python toolkit converting pronunciation in enwiktionary xml dump to cmudict format☆33Updated 6 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆256Updated 3 years ago
- A python package for deep multilingual punctuation prediction.☆154Updated last year
- Universal Romanizer that can convert any unicode script to roman (latin) script☆237Updated last year
- Cython wrapper on Hunspell Dictionary☆66Updated last year
- Model for recasing and repunctuating ASR transcripts☆143Updated last year
- This is a neural spelling checker☆69Updated 3 years ago
- Support tools for punctuation and boundary detection for ASR output.☆55Updated 3 years ago
- A python true casing utility that restores case information for texts☆88Updated 3 years ago
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆90Updated last year
- Complimentary code for our paper Automatic punctuation restoration with BERT models☆50Updated 2 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆34Updated 5 years ago
- Linguistic processing for Common Voice☆58Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆38Updated 11 months ago
- Targetted language identifier, based on FastText and Hunspell.☆38Updated 4 months ago
- A tokenizer and sentence splitter for German and English web and social media texts.☆150Updated last year
- ☆177Updated 9 months ago
- Text tokenization and sentence segmentation (segtok v2)☆208Updated 3 years ago