benob / recasepunc
Model for recasing and repunctuating ASR transcripts
☆132Updated 9 months ago
Alternatives and similar repositories for recasepunc:
Users that are interested in recasepunc are comparing it to the libraries listed below
- A tokenizer, text cleaner, and phonemizer for many human languages.☆295Updated 2 months ago
- How to create your own model for vosk☆65Updated 3 years ago
- phoneme tokenizer and grapheme-to-phoneme model for 8k languages☆151Updated last year
- DeepSpeech based forced alignment tool☆235Updated 4 years ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.☆104Updated 2 months ago
- Grapheme to phoneme conversion with deep learning.☆367Updated last year
- VOSK Speech Recognition Toolkit☆390Updated 2 years ago
- Segment an audio file and obtain utterance alignments. (Python package)☆325Updated 8 months ago
- Data and code for grapheme-to-phoneme transducers in lots of languages☆131Updated 9 months ago
- This is a github repository of the abandonware Sequitur G2P by Bisani & Ney☆157Updated 6 months ago
- A python package for deep multilingual punctuation prediction.☆111Updated 4 months ago
- Support tools for punctuation and boundary detection for ASR output.☆57Updated 2 years ago
- Gecko - A Tool for Effective Annotation of Human Conversations☆279Updated last year
- Some simple wrappers around kaldi-asr intended to make using kaldi's (online) decoders as convenient as possible.☆170Updated 3 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆144Updated 8 months ago
- Python server for communicating with Kaldi from the browser using WebRTC☆68Updated last year
- ☆22Updated 3 years ago
- 🐸STT integration examples☆122Updated 2 years ago
- Phonetisaurus G2P☆457Updated 7 months ago
- Open tools and data for cloudless automatic speech recognition☆446Updated 3 years ago
- Grapheme-to-Phoneme transductions that preserve input and output indices, and support cross-lingual g2p!☆144Updated this week
- Linguistic processing for Common Voice☆52Updated last year
- ☆38Updated 3 years ago
- Grapheme To Phoneme☆70Updated 5 months ago
- A python library for working with praat, textgrids, time aligned audio transcripts, and audio files. It is primarily used for extracting …☆320Updated last year
- Few-shot Keyword Spotting in Any Language and Multilingual Spoken Word Corpus☆170Updated last month
- Read, write, and manipulate Praat TextGrid files with Python☆126Updated last year
- Python library for manipulating pronunciations using the International Phonetic Alphabet (IPA)☆84Updated last year
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆107Updated last year
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago