mechanicalsea / spectraView external linksLinks
Spectra extraction tutorials based on torch and torchaudio.
☆41Aug 8, 2023Updated 2 years ago
Alternatives and similar repositories for spectra
Users that are interested in spectra are comparing it to the libraries listed below
Sorting:
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated last year
- code for paper "learning to fool the speaker recognition"☆10Jun 12, 2020Updated 5 years ago
- [ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…☆32Apr 8, 2022Updated 3 years ago
- Efficient Speech Processing Tookit for Automatic Speaker Recognition☆17Feb 8, 2023Updated 3 years ago
- ☆17Apr 14, 2023Updated 2 years ago
- ☆12Jun 14, 2022Updated 3 years ago
- ☆17Dec 17, 2025Updated last month
- Converts TensorFlow checkpoints (with index, meta and data files) to PyTorch, HDF5 and JSON☆18Feb 26, 2021Updated 4 years ago
- The codebase for Data-driven general-purpose voice activity detection.☆93Aug 3, 2023Updated 2 years ago
- WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models wi…☆92Jun 6, 2021Updated 4 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Apr 26, 2023Updated 2 years ago
- Repository for Weak Label Learning for Audio Events - A closer look. Uses Audioset subset data provided for reproducibility.☆32Sep 13, 2023Updated 2 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- ☆29May 4, 2020Updated 5 years ago
- 24-hour Automatic Speech Recognition☆27Jun 4, 2021Updated 4 years ago
- Unofficial PyTorch implementation of GLAMpoints: Greedily Learned Accurate Match points☆28Jun 22, 2022Updated 3 years ago
- Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work: https://arxiv.org/abs/2007.16196☆320Nov 11, 2020Updated 5 years ago
- 1st place solution to the DCASE 2019 - Task 5 - Urban Sound Tagging☆30Mar 19, 2021Updated 4 years ago
- Python toolkit for likelihood-ratio calibration of binary classifiers☆27Feb 21, 2023Updated 2 years ago
- ☆32Jul 27, 2022Updated 3 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆27Nov 20, 2014Updated 11 years ago
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Jul 24, 2023Updated 2 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆63Oct 15, 2019Updated 6 years ago
- ☆76Oct 25, 2021Updated 4 years ago
- Python library for audio augmentation☆85Jul 6, 2023Updated 2 years ago
- CN-Celeb, a large-scale Chinese celebrities dataset published by Center for Speech and Language Technology (CSLT) at Tsinghua University.☆77Nov 9, 2019Updated 6 years ago
- Creates CMM script that can directly executed on Kaggle from easy merge script☆13Jan 12, 2026Updated last month
- Source code for the paper "Memory-Efficient Fine-Tuning via Low-Rank Activation Compression"☆13Aug 1, 2025Updated 6 months ago
- ☆11Jan 28, 2022Updated 4 years ago
- ☆76Mar 18, 2022Updated 3 years ago
- Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"☆83Jul 20, 2022Updated 3 years ago
- An audio classification system for learning with out-of-distribution data☆33Dec 8, 2022Updated 3 years ago
- One-shot TTS with Improved Unseen Speaker and Style Transfer☆37Mar 2, 2022Updated 3 years ago
- Gamma Agreement in Python☆45Mar 4, 2024Updated last year
- This project focuses on developing a machine learning model to classify various electrical fault types in a transmission line. The model …☆15Apr 9, 2024Updated last year
- ☆34Updated this week
- Phase Vocoder and Wavelet Transform Implementation for Pitch Shifting a sound signal☆11Jul 27, 2020Updated 5 years ago
- Using large language models to maintain AI_CHANGELOG.md☆14Jul 15, 2024Updated last year
- 1st place solution to the DCASE 2020 - Task 5 - Urban Sound Tagging with Spatiotemporal Context☆16Dec 8, 2022Updated 3 years ago