allo-media / text2num
Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.
☆103Updated 2 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for text2num
- Simple multilingual lemmatizer for Python, especially useful for speed and efficiency☆145Updated this week
- A module for normalising text.☆173Updated 3 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆74Updated last year
- Python Finite-State Toolkit☆45Updated 2 weeks ago
- Linguistic processing for Common Voice☆52Updated 10 months ago
- Massively multilingual pronunciation mining☆321Updated this week
- A Python 3 phonetics library.☆124Updated 4 years ago
- Compound splitter for German language ("Komposita-Zerlegung") based on large dictionary combined with highly efficient multi-pattern stri…☆22Updated 2 years ago
- Polish morphological tagger.☆43Updated last year
- 📝An easy-to-use package to restore punctuation of the text.☆108Updated last year
- Support tools for punctuation and boundary detection for ASR output.☆57Updated last year
- Universal Romanizer that can convert any unicode script to roman (latin) script☆154Updated 3 months ago
- Model for recasing and repunctuating ASR transcripts☆129Updated 7 months ago
- A python package for deep multilingual punctuation prediction.☆99Updated 3 months ago
- Python module for syllabifying English ARPABET transcriptions☆64Updated 5 years ago
- Morfessor is a tool for unsupervised and semi-supervised morphological segmentation☆185Updated 4 years ago
- Abydos NLP/IR library for Python☆183Updated 2 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆36Updated last year
- In the wild extraction of entities that are found using Flair and displayed using a very elegant front-end.☆69Updated last year
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆48Updated 2 months ago
- SIGMORPHON 2022 Shared Task on Morpheme Segmentation☆24Updated last year
- ✔️Contextual word checker for better suggestions (not actively maintained)☆409Updated last month
- Bicleaner fork that uses neural networks☆38Updated 3 months ago
- ipapy is a Python module to work with International Phonetic Alphabet (IPA) strings☆81Updated 6 months ago
- Efficient Trie-based regex unions for blacklist/whitelist filtering and one-pass mapping-based string replacing☆67Updated 3 weeks ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆230Updated 2 years ago
- spaCy + UDPipe☆161Updated 2 years ago
- This is a neural spell checker☆60Updated last year
- Convert number words (eg. twenty one) to numeric digits (21)☆168Updated last year
- Python package and data files for manipulating phonological segments (phones, phonemes) in terms of universal phonological features.☆221Updated 3 months ago