alvenirai / punctfixLinks
β23Updated last year
Alternatives and similar repositories for punctfix
Users that are interested in punctfix are comparing it to the libraries listed below
Sorting:
- β358Updated last year
- πAn easy-to-use package to restore punctuation of the text.β118Updated 2 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecodeβ111Updated 3 years ago
- A merged version of multiple open-source German speech datasets.β33Updated last year
- β39Updated 3 years ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.β109Updated 4 months ago
- A model that predicts the punctuation of English, Italian, French and German texts.β80Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ107Updated last month
- Speakerbox: Fine-tune Audio Transformers for speaker identification.β59Updated 10 months ago
- DaCy: The State of the Art Danish NLP pipeline using SpaCyβ98Updated 9 months ago
- A PyTorch Lightning Callback for pushing models to the Hugging Face Hub π€β‘οΈβ35Updated 3 years ago
- Universal Romanizer that can convert any unicode script to roman (latin) scriptβ226Updated last year
- β310Updated last year
- β48Updated 2 years ago
- Triton backend for https://github.com/OpenNMT/CTranslate2β35Updated 2 years ago
- Linguistic processing for Common Voiceβ57Updated last year
- Text utilities, including beam search decoding, tokenizing, and more, built for use in Flashlight.β74Updated last month
- A PyPI package for fast word/character error rate (WER/CER) calculationβ72Updated 2 years ago
- Execute arbitrary SQL queries on π€ Datasetsβ32Updated last year
- β56Updated 2 years ago
- Various speech datasets made available to the publicβ131Updated 10 months ago
- π¬ Language Identification with Support for More Than 2000 Labels -- EMNLP 2023β162Updated 4 months ago
- This is a neural spelling checkerβ67Updated 2 years ago
- A spaCy custom component that extracts and normalizes temporal expressionsβ55Updated 2 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.β82Updated 2 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR modelsβ31Updated 4 years ago
- Audio Preprocessing and finetuning of wav2vec2-large-xlsr model on AI4D Baamtu Datamation - Automatic Speech Recognition in WOLOF Data.β17Updated 3 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!β176Updated this week
- The Gridspace-Stanford Harper Valley speech dataset. Created in support of CS224S.β49Updated 4 years ago
- Repository contains code to fine-tune WhisperASR modelβ23Updated 2 years ago