nomonosound/fast-align-audio

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nomonosound/fast-align-audio)

nomonosound / fast-align-audio

A fast python library for aligning similar audio snippets passed in as NumPy arrays

☆50

Alternatives and similar repositories for fast-align-audio

Users that are interested in fast-align-audio are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

nomonosound / log-wmse-audio-quality
View on GitHub
logWMSE, an audio quality metric with support for digital silence target. Useful for evaluating audio source separation systems, even whe…
☆39Jun 24, 2025Updated last year
nomonosound / numpy-minmax
View on GitHub
A fast function (SIMD-accelerated) for finding the minimum and maximum value in a NumPy array
☆15Jul 20, 2026Updated last week
kyegomez / Audio-xLSTMs
View on GitHub
Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch
☆20Updated this week
nomonosound / yulewalker
View on GitHub
IIR filter estimation of an arbitrary magnitude response using the modified Yule-Walker method
☆22Apr 17, 2024Updated 2 years ago
idiap / zff_vad
View on GitHub
Unsupervised Voice Activity Detection by Modeling Source and System Information using Zero Frequency Filtering
☆23Oct 19, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
popcornell / OSDC
View on GitHub
☆18Jan 26, 2021Updated 5 years ago
kwatcharasupat / source-separation-landing
View on GitHub
Landing Page for All Things Source Separation
☆38Sep 12, 2025Updated 10 months ago
chymaera96 / NeuralSampleID
View on GitHub
An automatic sample identification (ASID) system using a contrastively trained GNN encoder.
☆17Sep 21, 2025Updated 10 months ago
MTG / PodcastMix-inference
View on GitHub
☆32Jan 6, 2022Updated 4 years ago
Pexeso / audio-fingerprinting-benchmark-toolkit
View on GitHub
☆21Dec 19, 2023Updated 2 years ago
XiaoyuBIE1994 / SDCodec
View on GitHub
(ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec
☆48May 16, 2025Updated last year
helemanc / audio-based-lyrics-matching
View on GitHub
Official Implementation of the paper "Leveraging Whisper Embeddings for Audio-based Lyrics Matching"
☆17Apr 23, 2026Updated 3 months ago
asteroid-team / pytorch-pit
View on GitHub
Permutation invariant training in PyTorch
☆13Oct 2, 2020Updated 5 years ago
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
benfmiller / audalign
View on GitHub
Package for aligning audio files through audio fingerprinting
☆148May 26, 2026Updated 2 months ago
desh2608 / diarizer
View on GitHub
Clustering-based methods for overlapping diarization
☆84Jan 12, 2024Updated 2 years ago
Sreyan88 / ReCLAP
View on GitHub
☆33Dec 23, 2025Updated 7 months ago
SarthakYadav / axlstm-official
View on GitHub
Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"
☆21Sep 7, 2025Updated 10 months ago
jonashaag / RealRIRs
View on GitHub
Python loaders for many Real Room Impulse Response databases
☆97Sep 30, 2024Updated last year
desh2608 / gss
View on GitHub
A simple package for Guided source separation (GSS)
☆134May 20, 2024Updated 2 years ago
schufo / tisms
View on GitHub
This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"
☆16Apr 8, 2024Updated 2 years ago
deezer / musicFPaugment
View on GitHub
Code for reproducting the paper Music Augmentation and Denoising For Peak-Based Audio Fingerprinting
☆17Oct 31, 2023Updated 2 years ago
MTG / Podcastmix
View on GitHub
PodcastMix A dataset for separating music and speech in podcasts.
☆44Aug 20, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
yongyizang / TrainingFreeMultiStepASR
View on GitHub
Official Repository for "Training-Free Multi-Step Audio Source Separation"
☆54May 26, 2025Updated last year
chymaera96 / GraFP
View on GitHub
Official repository for GraFPrint: an audio identification framework based on graph neural networks.
☆41Sep 18, 2025Updated 10 months ago
google-research / last
View on GitHub
A JAX library for building lattice-based speech transducer models
☆48Jul 2, 2026Updated 3 weeks ago
awni / future_speech
View on GitHub
The History of Speech Recognition to the Year 2030
☆13Aug 14, 2021Updated 4 years ago
fgnt / mms_msg
View on GitHub
Multipurpose Multi Speaker Mixture Signal Generator
☆46Feb 6, 2025Updated last year
TomohikoNakamura / asteroid_jaCappella
View on GitHub
☆14Jul 28, 2023Updated 3 years ago
Pliploop / SemiSupCon
View on GitHub
Semi-Supervised Contrastive Learning for music classification - towards HIL-representation learning.
☆17Jul 24, 2024Updated 2 years ago
WangHelin1997 / Fast-GeCo
View on GitHub
Source code and demo for INTERSPEECH 2024 paper: Noise-robust Speech Separation with Fast Generative Correction
☆50Nov 19, 2024Updated last year
Liu-Feng-deeplearning / CoverHunter
View on GitHub
Official PyTorch implementation of CoverHunter
☆43Nov 21, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
egorsmkv / asr-corpus-creator
View on GitHub
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Feb 15, 2024Updated 2 years ago
vinusankars / ESOLA
View on GitHub
Epoch-synchronous overlap-add (ESOLA) for time-and pitch-scale modification of speech signals.
☆23Jul 24, 2020Updated 6 years ago
GATECH-EIC / S3-Router
View on GitHub
[NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…
☆17Sep 19, 2023Updated 2 years ago
jeonchangbin49 / LimitAug
View on GitHub
☆23Aug 30, 2022Updated 3 years ago
NovaFrost / SHS100K
View on GitHub
metadata for SHS100K
☆24Dec 25, 2017Updated 8 years ago
danpovey / filtering
View on GitHub
Utilities for resampling and filtering audio data
☆47Jan 9, 2020Updated 6 years ago
groupmm / synctoolbox
View on GitHub
A Python toolbox with reference implementations for efficient, robust, and accurate music synchronization based on dynamic time warping (…
☆138May 28, 2026Updated 2 months ago