allo-media / text2num
Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.
β106Updated 2 months ago
Alternatives and similar repositories for text2num:
Users that are interested in text2num are comparing it to the libraries listed below
- A module for normalising text.β173Updated 3 years ago
- πAn easy-to-use package to restore punctuation of the text.β114Updated 2 years ago
- Support tools for punctuation and boundary detection for ASR output.β57Updated 2 years ago
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languagesβ211Updated 8 months ago
- Massively multilingual pronunciation miningβ335Updated last week
- A python package for deep multilingual punctuation prediction.β120Updated 7 months ago
- Punctuation restoration and spell correction experiments.β252Updated 4 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.β80Updated 2 years ago
- Rust-based Python wrapper for duckling library in Haskellβ25Updated 4 years ago
- Model for recasing and repunctuating ASR transcriptsβ133Updated last year
- SegEval Segmentation Evaluation Packageβ56Updated last year
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.β71Updated 2 years ago
- Complimentary code for our paper Automatic punctuation restoration with BERT modelsβ49Updated last year
- Hunspell extension for spaCy 2.0.β94Updated 8 months ago
- βοΈContextual word checker for better suggestions (not actively maintained)β414Updated 2 months ago
- Automatic extraction of edited sentences from text edition histories.β83Updated 3 years ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern striβ¦β24Updated 2 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!β158Updated last week
- python code for converting among IPA, ARPABET, XSAMPA, Callhome, DISC, TIMIT, plus some lexical tones.β33Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.β153Updated 10 months ago
- β22Updated last year
- Open Source AI Benchmarking toolkit for benchmarking speech to text servicesβ55Updated 11 months ago
- Automatically constructing corpus for automatic speech recognition from YouTube videosβ154Updated 5 years ago
- Abydos NLP/IR library for Pythonβ185Updated 2 years ago
- Cython wrapper on Hunspell Dictionaryβ66Updated 9 months ago
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.β250Updated 8 months ago
- A spaCy custom component that extracts and normalizes temporal expressionsβ54Updated 2 years ago
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiencyβ154Updated 4 months ago
- now you can even use apertium from pythonβ31Updated last year
- Faster, modernized fork of the language identification tool langid.pyβ55Updated 4 months ago