alvenirai / punctfixLinks
☆24Updated last year
Alternatives and similar repositories for punctfix
Users that are interested in punctfix are comparing it to the libraries listed below
Sorting:
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆112Updated last month
- ☆357Updated last year
- 📝An easy-to-use package to restore punctuation of the text.☆119Updated 2 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecode☆111Updated 3 years ago
- A merged version of multiple open-source German speech datasets.☆34Updated last year
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub 🤗⚡️☆35Updated 3 years ago
- This is a neural spelling checker☆69Updated 3 years ago
- Triton backend for https://github.com/OpenNMT/CTranslate2☆35Updated 2 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.☆83Updated 2 years ago
- A python package for deep multilingual punctuation prediction.☆156Updated last year
- Execute arbitrary SQL queries on 🤗 Datasets☆32Updated 2 years ago
- A python package for finding words that sound like other words. Useful for entity resolution and poetry, among other things.☆15Updated 3 years ago
- ☆50Updated 3 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆30Updated 4 years ago
- Universal Romanizer that can convert any unicode script to roman (latin) script☆237Updated last year
- DaCy: The State of the Art Danish NLP pipeline using SpaCy☆100Updated last year
- Augmenty is an augmentation library based on spaCy for augmenting texts.☆156Updated last year
- ☆323Updated last year
- Various speech datasets made available to the public☆130Updated last year
- ☆40Updated 4 years ago
- Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.☆78Updated last month
- A PyPI package for fast word/character error rate (WER/CER) calculation☆71Updated 2 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆345Updated last year
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆190Updated 2 weeks ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated 3 years ago
- Confection: the sweetest config system for Python☆193Updated this week
- ☆45Updated 3 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Updated 4 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80Updated 2 years ago
- Linguistic processing for Common Voice☆58Updated 2 years ago