NIST SPH File reader (e.g. for TEDLIUM Corpus)
☆26May 2, 2020Updated 5 years ago
Alternatives and similar repositories for sphfile
Users that are interested in sphfile are comparing it to the libraries listed below
Sorting:
- readers that enable reading kaldi ark in tensorflow☆17Mar 7, 2018Updated 7 years ago
- Self-contained Python package for OpenFst☆51Feb 1, 2023Updated 3 years ago
- Compute the most likely permutation of a lattice given an LM☆10Jan 3, 2013Updated 13 years ago
- ☆53Dec 18, 2020Updated 5 years ago
- Optimizing speaker verification and spoofing countermeasure systems together with REINFORCE☆13Mar 31, 2021Updated 4 years ago
- ☆18Jan 17, 2022Updated 4 years ago
- SWIG bindings for Kaldi I/O, built with Conda☆15Dec 15, 2024Updated last year
- A toolkit for reproducible evaluation, diagnostic, and error analysis of speaker diarization systems☆242Dec 16, 2025Updated 2 months ago
- ☆15May 8, 2021Updated 4 years ago
- 2nd place solution for ID R&D Voice Antispoofing Challenge☆15Aug 22, 2019Updated 6 years ago
- Crowdsourced and Automatic Speech Prominence Estimation☆25Apr 12, 2024Updated last year
- CDER (Conversational Diarization Error Rate) Scoring Tool☆22Sep 13, 2022Updated 3 years ago
- Wavenet pytorch implementation for text-to-speech☆18Jul 19, 2023Updated 2 years ago
- VoxSRC2022 workshop development kit☆19Jul 21, 2022Updated 3 years ago
- A python IO interface for data accessing in kaldi☆39Mar 18, 2021Updated 4 years ago
- Addressing the confounds of accompaniments in singer identification☆18Mar 24, 2020Updated 5 years ago
- speech engine training projects☆29Apr 19, 2021Updated 4 years ago
- Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for A…☆24Aug 7, 2019Updated 6 years ago
- Evaluation script for VoxMovies dataset in PyTorch☆23Jan 12, 2024Updated 2 years ago
- ☆21Sep 24, 2018Updated 7 years ago
- Code for the method proposed in the paper:- ccc-wav2vec 2.0: Clustering aided Cross-Contrastive learning of Self-Supervised speech repres…☆23Mar 18, 2024Updated last year
- Meta-embeddings are a probabilistic generalization of embeddings in machine learning.☆23Nov 23, 2018Updated 7 years ago
- Prosodic Speech Segmentation with Transformers☆26Feb 25, 2024Updated 2 years ago
- ☆24Sep 25, 2018Updated 7 years ago
- Small language toolkit for creation, interpolation and pruning of ARPA language models☆92Aug 6, 2022Updated 3 years ago
- ☆109Jun 14, 2023Updated 2 years ago
- Audio Diarization Annotation tool☆30Nov 8, 2019Updated 6 years ago
- Calculation of MCD (dB) between two speech waveforms☆57Sep 26, 2020Updated 5 years ago
- Baseline Recipe for VoicePrivacy Challenge 2020: https://www.voiceprivacychallenge.org/vp2020/docs/VoicePrivacy_2020_Eval_Plan_v1_3.pdf☆64Jul 6, 2023Updated 2 years ago
- Official codes for the paper "Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech"☆28Feb 22, 2022Updated 4 years ago
- Implementation of the Links Online Clustering algorithm: https://arxiv.org/abs/1801.10123☆30Oct 9, 2021Updated 4 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Jan 5, 2026Updated last month
- This repository contains the training, inference, evaluation code for SpeechLLM models and details about the model releases on huggingfac…☆130Jun 25, 2024Updated last year
- Official PyTorch implementation of the paper "Robust Training for Speaker Verification against Noisy Labels" in INTERSPEECH 2023.☆11Oct 23, 2023Updated 2 years ago
- Yet another speech toolkit based on Kaldi and PyTorch☆173Jul 1, 2020Updated 5 years ago
- Audio Captioning datasets for PyTorch.☆127Jul 18, 2025Updated 7 months ago
- Python runtime for WeTextProcessing (does not depend on Pynini)☆48Nov 28, 2025Updated 3 months ago
- Versatile Evaluation of Speech and Audio☆392Dec 9, 2025Updated 2 months ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago