k2-fsa / colab
Colab notebooks for Next-gen Kaldi
☆27Updated 3 weeks ago
Alternatives and similar repositories for colab:
Users that are interested in colab are comparing it to the libraries listed below
- ☆26Updated 3 months ago
- Decoders from Kaldi using OpenFst☆28Updated 3 months ago
- Python wrapper for kaldi's arpa2fst☆38Updated 5 months ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆51Updated last week
- A simple command line tool to calculate WER for ASR.☆14Updated 6 months ago
- Some fast-ish algorithms for batch text search in moderate-sized collections, intended for data cleanup☆71Updated 8 months ago
- ☆25Updated 6 months ago
- Python Wrapper of Silero VAD☆51Updated 2 weeks ago
- faster inference☆28Updated 3 months ago
- (WIP)long form speech generatoins☆31Updated last month
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆20Updated 5 months ago
- Torchaudio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆11Updated 4 months ago
- ☆11Updated 3 years ago
- Chinese and English Bilinguish G2P☆21Updated last year
- Went online decode demo☆29Updated 4 years ago
- multilingual speech aligner☆74Updated last year
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆49Updated 9 months ago
- Pronunciation-assisted Subword Modeling☆29Updated 5 years ago
- wav2vec2 audio classification for prosodic boundary detection and other tasks☆42Updated last year
- Kaldi-compatible online fbank extractor without external dependencies☆97Updated last week
- CTC decoder with hotwords for ASR.☆19Updated 3 weeks ago
- Just another FastSpeech 2 but cleaner code :)☆26Updated 10 months ago
- ☆39Updated 9 months ago
- noise reduction☆17Updated 10 months ago
- [ICASSP2023] Source code, model links and open test sets for paper SeACo-Paraformer.☆29Updated last year
- open-source Mandarian biased word dataset☆11Updated last year
- Prosodic Speech Segmentation with Transformers☆25Updated last year
- Utilizes ONNX Runtime for audio denoising.☆45Updated last week
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆31Updated last year
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆96Updated last month