benob / recasepunc
Model for recasing and repunctuating ASR transcripts
β133Updated 10 months ago
Alternatives and similar repositories for recasepunc:
Users that are interested in recasepunc are comparing it to the libraries listed below
- A tokenizer, text cleaner, and phonemizer for many human languages.β303Updated 3 months ago
- Support tools for punctuation and boundary detection for ASR output.β57Updated 2 years ago
- πΈSTT integration examplesβ125Updated 2 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languagesβ154Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ145Updated 9 months ago
- How to create your own model for voskβ70Updated 3 years ago
- Data and code for grapheme-to-phoneme transducers in lots of languagesβ132Updated 10 months ago
- A python package for deep multilingual punctuation prediction.β115Updated 6 months ago
- Segment an audio file and obtain utterance alignments. (Python package)β328Updated 9 months ago
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.β170Updated 3 years ago
- A model that predicts the punctuation of English, Italian, French and German texts.β78Updated last year
- Python server for communicating with Kaldi from the browser using WebRTCβ69Updated last year
- Grapheme to phoneme conversion with deep learning.β376Updated last year
- β38Updated 3 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learningβ37Updated 2 years ago
- πAn easy-to-use package to restore punctuation of the text.β112Updated last year
- Open tools and data for cloudless automatic speech recognitionβ447Updated 3 years ago
- Small language toolkit for creation, interpolation and pruning of ARPA language modelsβ91Updated 2 years ago
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpusβ172Updated 2 months ago
- DeepSpeech based forced alignment toolβ237Updated 4 years ago
- β22Updated 3 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ96Updated last week
- Modified version of RusStress (https://github.com/MashaPo/russtress) β python package for placing stress in Russian text using RNN (BiLSTβ¦β31Updated 6 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ107Updated 2 years ago
- Punctuation and casing restoration for the Russian Language (BERT-based)β20Updated 3 years ago
- openvino version of openai/whisperβ165Updated last year
- Punctuation Restoration using Transformer Models for High-and Low-Resource Languagesβ206Updated 6 months ago
- Automatically constructing corpus for automatic speech recognition from YouTube videosβ154Updated 5 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphoneβ36Updated 3 years ago
- Linguistic processing for Common Voiceβ53Updated last year