edufonseca / shift_secLinks

Code for the paper "Improving Sound Event Classification by Increasing Shift Invariance in Convolutional Neural Networks".

☆13

Alternatives and similar repositories for shift_sec

Users that are interested in shift_sec are comparing it to the libraries listed below

Sorting:

haoheliu / DCASE_2022_Task_5
System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection
☆28Updated 3 years ago
ljuvela / GELP
☆26Updated 4 years ago
tqbl / ood_audio
An audio classification system for learning with out-of-distribution data
☆33Updated 2 years ago
shincling / discreteSeparation
The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".
☆12Updated 3 years ago
Sytronik / deep-griffinlim-iteration
PyTorch implementation for Deep Griffin-Lim Iteration paper(https://arxiv.org/abs/1903.03971)
☆39Updated 5 years ago
ttaoREtw / semi-tts
Semi-supervised Learning for Multi-speaker Text-to-speech Synthesis Using Discrete Speech Representation
☆39Updated 4 years ago
speechnovateur / languagecodec_tmp
Temporary anonymous version
☆22Updated last year
SpeechColab / PySpeechColab
A library of speech gadgets.
☆13Updated 2 years ago
fgnt / paderbox
Paderbox: A collection of utilities for audio / speech processing
☆38Updated 2 months ago
sarulab-speech / lightweight_spkr_anon
Lightweight speaker anonymization [IEEE SLT2021]
☆26Updated 3 years ago
VITA-Group / Audio-Lottery
[ICLR 2022] "Audio Lottery: Speech Recognition Made Ultra-Lightweight, Noise-Robust, and Transferable", by Shaojin Ding, Tianlong Chen, Z…
☆31Updated 3 years ago
BirdVox / PCEN-SNR
Audio activity detector based on per-channel energy normalization (PCEN)
☆30Updated 6 years ago
biboamy / AVASpeech_Music_Labels
☆18Updated 3 years ago
ws-choi / LASAFT-Net-v2
A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"
☆33Updated 3 years ago
hhguo / WaveRNN
Based on https://github.com/fatchord/WaveRNN
☆24Updated 5 years ago
MTG / PodcastMix-inference
☆32Updated 3 years ago
asteroid-team / pytorch-pit
Permutation invariant training in PyTorch
☆13Updated 4 years ago
schufo / tisms
This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"
☆15Updated last year
sarulab-speech / multi-speaker-dgp
Official implementation of DGP-based multi-speaker speech synthesis with PyTorch
☆24Updated 4 years ago
patrickltobing / shallow-wavenet
☆18Updated 5 years ago
PanagiotisP / svs-multiband
Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022
☆15Updated 3 years ago
etzinis / biased_separation
Code for the paper: Unified Gradient Reweighting for Model Biasing with Applications to Source Separation
☆14Updated 4 years ago
cschaefer26 / StyleMelGAN
☆10Updated last year
rhoposit / icassp2021
☆15Updated 4 years ago
Aria-K-Alethia / speaking-rate-controllable-hifi-gan
☆16Updated 3 years ago
gpu-poor / gramvaani_hindi_asr
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆15Updated 3 years ago
haoxiangsnr / audioinfo
A small tool to calculate the distribution of audio durations in a directory
☆14Updated 2 years ago
vivsivaraman / sourcesepganprior
☆18Updated 4 years ago
SubramaniKrishna / STFTgrad
Accompanying code for our paper "Optimizing Short-Time Fourier Transform Parameters via Gradient Descent"
☆33Updated 4 years ago
robflynnyh / long-context-asr
Code for the paper: How Much Context Does My Attention-Based ASR System Need?
☆10Updated 2 months ago