Fast and high quality sample-rate conversion library for Python
☆105Oct 12, 2025Updated 4 months ago
Alternatives and similar repositories for python-soxr
Users that are interested in python-soxr are comparing it to the libraries listed below
Sorting:
- Permutation invariant training in PyTorch☆13Oct 2, 2020Updated 5 years ago
- A differentiable version of SPTK☆193Feb 3, 2026Updated 3 weeks ago
- HiFi++: a Unified Framework for Neural Vocoding, Bandwidth Extension and Speech Enhancement☆158Jul 16, 2022Updated 3 years ago
- A Python Library for Fundamental Frequency Estimation in Music Recordings☆54Jan 16, 2026Updated last month
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆115Jun 23, 2025Updated 8 months ago
- Repository of published DNN speech separation recipes for a number of datasets☆12Jan 22, 2024Updated 2 years ago
- ☆54Mar 2, 2023Updated 2 years ago
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 8 months ago
- Whisper Speech Quality Assessment (WhiSQA)☆16Oct 14, 2025Updated 4 months ago
- Official code for paper:"Speaking Clearly: A Simplified Whisper-Based Codec for Low-Bitrate Speech Coding"☆33Jan 28, 2026Updated last month
- Pytorch implementation of subband decomposition☆92Jul 26, 2022Updated 3 years ago
- Fast and differentiable time domain all-pole filter in PyTorch.☆68Feb 5, 2026Updated 3 weeks ago
- Python bindings for libsamplerate based on CFFI and NumPy☆57Dec 6, 2025Updated 2 months ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆146Aug 22, 2022Updated 3 years ago
- Official implementation of the source-filter HiFiGAN vocoder☆268Jul 29, 2023Updated 2 years ago
- Evaluation tool used in the BigVSAN paper☆14Mar 22, 2024Updated last year
- Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"☆34Oct 30, 2020Updated 5 years ago
- Latent Space Sound Design Tool based on the VAE of stable-audio-open☆15Aug 23, 2024Updated last year
- Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”☆56Jun 1, 2025Updated 8 months ago
- The open source code for LLM-Codec☆145Aug 18, 2024Updated last year
- Big Impulse Response Dataset☆156Oct 19, 2022Updated 3 years ago
- Pitch Estimating Neural Networks (PENN)☆271Apr 2, 2025Updated 10 months ago
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆87Dec 20, 2024Updated last year
- ☆62Nov 6, 2023Updated 2 years ago
- PyTorch implementation of the ICASSP-24 paper: "Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Superv…☆38Jan 6, 2024Updated 2 years ago
- Train no-reference speech quality estimators with multiple datasets via learned, per-dataset alignments.☆18Aug 1, 2025Updated 6 months ago
- Frechet Audio Distance evaluation in PyTorch☆36Jun 9, 2023Updated 2 years ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆50Nov 11, 2025Updated 3 months ago
- PodcastMix A dataset for separating music and speech in podcasts.☆44Aug 20, 2024Updated last year
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆77Jul 16, 2023Updated 2 years ago
- A lightweight library for Frechet Audio Distance calculation.☆309Feb 11, 2026Updated 2 weeks ago
- Implementation of Emo-StarGAN☆46Dec 19, 2023Updated 2 years ago
- Comparison of Python audio resampling implementations☆54Jun 30, 2021Updated 4 years ago
- ☆21Jul 15, 2024Updated last year
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Jul 2, 2024Updated last year
- Pytorch implementation of BigVSAN☆203Dec 9, 2025Updated 2 months ago
- Deep Performer: Score-to-audio music performance synthesis☆44Jun 26, 2023Updated 2 years ago
- ☆49Apr 1, 2025Updated 10 months ago
- ☆13Jun 2, 2022Updated 3 years ago