speechbrain/HyperPyYAML

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/speechbrain/HyperPyYAML)

speechbrain / HyperPyYAML

Extensions to YAML syntax for better python interaction

☆80

Alternatives and similar repositories for HyperPyYAML

Users that are interested in HyperPyYAML are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OSU-slatelab / mimic-enhance
View on GitHub
Speech enhancement using mimic loss
☆16Oct 25, 2019Updated 6 years ago
chenpk00 / IS2024_stream_decoder_only_asr
View on GitHub
☆16Mar 12, 2024Updated 2 years ago
facebookresearch / MMCSG
View on GitHub
This repository contains the baseline system for CHiME-8 MMCSG challenge focusing on transcribing both sides of a conversation where one …
☆41Mar 13, 2024Updated 2 years ago
Thytu / SMIT
View on GitHub
SMIT: A Simple Modality Integration Tool
☆15Mar 31, 2024Updated 2 years ago
zhaoyi2 / xvector-cnceleb
View on GitHub
kaldi based x-vector trained on Cn-Celeb
☆13Sep 22, 2020Updated 5 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
grazder / samejs
View on GitHub
Streaming Audio Models Examples in JS
☆20Mar 29, 2024Updated 2 years ago
Beilong-Tang / TSELM
View on GitHub
Official Implementation of TSELM: Target speaker extraction using discrete tokens and language models
☆60Apr 14, 2025Updated last year
lmxue / NVV-SuperBench
View on GitHub
NVV-SuperBench: Beyond Words, Beyond Quality—Benchmarking Nonverbal Vocalizations in Speech Generation (Interspeech 2026 long paper)
☆18Jun 21, 2026Updated last month
lumaku / ctc-segmentation
View on GitHub
Segment an audio file and obtain utterance alignments. (Python package)
☆348May 15, 2024Updated 2 years ago
OSU-slatelab / LibriStutter
View on GitHub
A recipe for disfluency detection on the LibriStutter dataset using SpeechBrain
☆11Mar 13, 2021Updated 5 years ago
AmirmohammadRostami / ASV-anti-spoofing-with-EABN
View on GitHub
☆15Feb 25, 2023Updated 3 years ago
deepvk / istftnet
View on GitHub
iSTFTNet : Fast and Lightweight Mel-spectrogram Vocoder Incorporating Inverse Short-time Fourier Transform
☆14Aug 25, 2023Updated 2 years ago
flashlight / sequence
View on GitHub
Sequence algorithms for use in Flashlight.
☆14Jan 12, 2026Updated 6 months ago
tuanct1997 / Federated-Learning-ASR-based-on-wav2vec-2.0
View on GitHub
☆18Mar 13, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
martinmamql / relative_predictive_coding
View on GitHub
Project page for paper Self-supervised Representation Learning with Relative Predictive Coding
☆19Jul 8, 2021Updated 5 years ago
utter-project / mHuBERT-147-scripts
View on GitHub
Collection of scripts from mHuBERT-147.
☆35Nov 19, 2024Updated last year
danpovey / openfst
View on GitHub
Dan's repository of OpenFst (manually created by downloading certain versions of OpenFst), created to track certain patches.
☆13Mar 8, 2016Updated 10 years ago
Chaanks / stklia
View on GitHub
simple version of our torch kaldi toolkit, developed at the LIA by 2 apprentices. (@Chaanks & @vbrignatz)
☆10Oct 10, 2021Updated 4 years ago
speechbrain / benchmarks
View on GitHub
This repository contains the SpeechBrain Benchmarks
☆140Feb 3, 2026Updated 5 months ago
desh2608 / gss
View on GitHub
A simple package for Guided source separation (GSS)
☆134May 20, 2024Updated 2 years ago
upskyy / Automatic-Speech-Recognition-Models
View on GitHub
End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.
☆10Jan 21, 2022Updated 4 years ago
yinruiqing / diarization_with_neural_approach
View on GitHub
☆14Aug 9, 2018Updated 7 years ago
chimechallenge / chime-utils
View on GitHub
Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.
☆26Feb 25, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
Knoema / knoema-python-driver
View on GitHub
☆12Jan 16, 2025Updated last year
0nutation / SLMTokBench
View on GitHub
SLMTokBench for paper "SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language Models"
☆37Aug 29, 2023Updated 2 years ago
Adel-Moumen / fast_sligru
View on GitHub
☆12Mar 24, 2024Updated 2 years ago
microsoft / NOTSOFAR1-Challenge
View on GitHub
NOTSOFAR-1 Challenge: Distant Diarization and ASR
☆65Feb 12, 2025Updated last year
slp-rl / SpokenStoryCloze
View on GitHub
A spoken version of the textual story cloze benchmark
☆22Aug 6, 2023Updated 2 years ago
common-voice / cv-dataset
View on GitHub
Metadata and versioning details for the Common Voice dataset
☆173Jun 16, 2026Updated last month
wavlab-speech / shinjiwlab.github.io
View on GitHub
☆18Jul 20, 2026Updated last week
Lallapallooza / fast-audiomentations
View on GitHub
⚡ Blazing fast audio augmentation in Python, powered by GPU for high-efficiency processing in machine learning and audio analysis tasks.
☆38May 8, 2026Updated 2 months ago
FAST-ASR / MarkovModels.jl
View on GitHub
Julia package for Hidden Markov Model
☆34Sep 11, 2023Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
awslabs / speech-representations
View on GitHub
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
☆104Nov 26, 2022Updated 3 years ago
kensho-technologies / pyctcdecode
View on GitHub
A fast and lightweight python-based CTC beam search decoder for speech recognition.
☆469Jul 13, 2023Updated 3 years ago
MontrealCorpusTools / Montreal-Forced-Aligner
View on GitHub
Command line utility for forced alignment using Kaldi
☆1,855Jul 11, 2026Updated 2 weeks ago
ludlows / PESQ
View on GitHub
PESQ (Perceptual Evaluation of Speech Quality) Wrapper for Python Users (narrow band and wide band)
☆630Mar 18, 2026Updated 4 months ago
rollovd / LookSAM
View on GitHub
This is unofficial repository for Towards Efficient and Scalable Sharpness-Aware Minimization.
☆37Apr 15, 2024Updated 2 years ago
luotianze666 / WaveFM
View on GitHub
[NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching
☆133Apr 8, 2026Updated 3 months ago
brohrer / lodgepole
View on GitHub
Image and video processing toolbox
☆10Jun 12, 2020Updated 6 years ago