alvenirai / punctfixLinks
β24Updated last year
Alternatives and similar repositories for punctfix
Users that are interested in punctfix are comparing it to the libraries listed below
Sorting:
- πAn easy-to-use package to restore punctuation of the text.β119Updated 2 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecodeβ111Updated 3 years ago
- β359Updated last year
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.β111Updated 6 months ago
- This is a neural spelling checkerβ69Updated 3 years ago
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub π€β‘οΈβ35Updated 3 years ago
- β40Updated 3 years ago
- Triton backend for https://github.com/OpenNMT/CTranslate2β35Updated 2 years ago
- β49Updated 3 years ago
- β318Updated last year
- Execute arbitrary SQL queries on π€ Datasetsβ32Updated last year
- A merged version of multiple open-source German speech datasets.β33Updated last year
- Universal Romanizer that can convert any unicode script to roman (latin) scriptβ232Updated last year
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR modelsβ30Updated 4 years ago
- A python package for deep multilingual punctuation prediction.β152Updated last year
- Linguistic processing for Common Voiceβ58Updated last year
- A model that predicts the punctuation of English, Italian, French and German texts.β83Updated 2 years ago
- DaCy: The State of the Art Danish NLP pipeline using SpaCyβ99Updated 11 months ago
- Various speech datasets made available to the publicβ130Updated last year
- Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.β78Updated this week
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.β80Updated 2 years ago
- Zero-shot Audio Classification using Whisperβ79Updated 3 years ago
- Speakerbox: Fine-tune Audio Transformers for speaker identification.β59Updated last year
- Repository for fine-tuning Transformers π€ based seq2seq speech models in JAX/Flax.β38Updated 2 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.β12Updated 2 years ago
- β44Updated 3 years ago
- π¬ Language Identification with Support for More Than 2000 Labels -- EMNLP 2023β176Updated last month
- The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.β49Updated 4 years ago
- β156Updated last week
- β56Updated 3 years ago