Pytorch port of Google Research's LEAF Audio paper
☆92May 19, 2021Updated 4 years ago
Alternatives and similar repositories for leaf-audio-pytorch
Users that are interested in leaf-audio-pytorch are comparing it to the libraries listed below
Sorting:
- LEAF is a learnable alternative to audio features such as mel-filterbanks, that can be initialized as an approximation of mel-filterbanks…☆520Mar 1, 2022Updated 4 years ago
- PyTorch implementation of the LEAF audio frontend☆77Mar 29, 2023Updated 2 years ago
- Learnable STRF, from Riad et al. 2021 JASA☆13Aug 21, 2021Updated 4 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Jul 6, 2022Updated 3 years ago
- Backpropagable pytorch implementation of https://craffel.github.io/mir_eval/.☆35Jul 8, 2024Updated last year
- Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal F…☆16Aug 9, 2021Updated 4 years ago
- Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.☆93Dec 22, 2022Updated 3 years ago
- ☆32Jan 6, 2022Updated 4 years ago
- Asteroid's filterbanks☆88Jan 12, 2025Updated last year
- it's a train acoustics model code lib☆27May 20, 2020Updated 5 years ago
- Pytorch implementation of time-domain filterbanks☆112Sep 16, 2021Updated 4 years ago
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆21Aug 9, 2023Updated 2 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Jun 12, 2023Updated 2 years ago
- Big Impulse Response Dataset☆156Oct 19, 2022Updated 3 years ago
- KABooks is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. Using a…☆12Mar 24, 2023Updated 2 years ago
- This repository created for the NHN ASR hackathon competition.☆11Sep 20, 2023Updated 2 years ago
- ☆11Nov 5, 2021Updated 4 years ago
- A C++/Cython audio limiter for Python.☆25Feb 8, 2023Updated 3 years ago
- A library for speech data augmentation in time-domain☆683Aug 30, 2021Updated 4 years ago
- ☆69Feb 15, 2021Updated 5 years ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Nov 10, 2023Updated 2 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 3 years ago
- Wavelet phase harmonic scattering transform☆13Jul 5, 2022Updated 3 years ago
- ☆11Mar 22, 2023Updated 2 years ago
- Binaural impulse responses captured in real rooms.☆37Mar 9, 2016Updated 9 years ago
- Overlapped Speech detection in Multi-party Conversations☆22Feb 20, 2018Updated 8 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decoding☆75Oct 11, 2021Updated 4 years ago
- ☆508Jun 25, 2024Updated last year
- An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.☆12Jul 5, 2021Updated 4 years ago
- ☆13Mar 11, 2025Updated 11 months ago
- Yin pitch estimator in PyTorch☆117Nov 7, 2022Updated 3 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Feb 6, 2021Updated 5 years ago
- Aty-TTS: Improving fairness for spoken language understanding in atypical speech with Text-to-Speech☆11May 14, 2025Updated 9 months ago
- Audio processing by using pytorch 1D convolution network☆1,117Dec 7, 2025Updated 2 months ago
- This is the Python library for an unsupervised, fast method for robust voice activity detection (rVAD), as in the paper rVAD: An Unsuperv…☆151Jun 5, 2025Updated 8 months ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆25Jul 27, 2024Updated last year
- Upsampling Artifacts in Neural Audio Synthesis – https://arxiv.org/abs/2010.14356☆82Feb 9, 2021Updated 5 years ago