fmqa / samplecnn-speech-detection
Speech/Music discrimination using SampleCNN
☆18Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for samplecnn-speech-detection
- Production first, nn-based on-device signal processing toolkit.☆64Updated last year
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 4 years ago
- A pitch tracker inspired by David Talkin's RAPT (Robust Algorithm for Pitch Tracking) written in Python.☆47Updated 8 years ago
- ☆13Updated last year
- Linear Prediction Coefficients estimation from mel-spectrogram implemented in Python based on Levinson-Durbin algorithm.☆68Updated 3 years ago
- Python wrapper for OpenFST and its extensions from Kaldi. Also support reading/writing ark/scp files☆47Updated 4 months ago
- Convert WSJ sphere format to waveform and do data simulation.☆16Updated 4 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆20Updated 3 years ago
- ☆19Updated last year
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge in Online Conferencing Applications☆43Updated 2 years ago
- Multipurpose Multi Speaker Mixture Signal Generator☆43Updated last month
- Objective measures of speech quality SNR☆18Updated 5 years ago
- Pytorch implementation of subband decomposition☆89Updated 2 years ago
- multilingual speech aligner☆72Updated last year
- Reproduction of paper: Disentangling Correlated Speaker and Noise for Speech Synthesis via Data Augmentation and Adversarial Factorizatio…☆17Updated 5 years ago
- A Full Text-Dependent End to End Mispronunciation Detection and Diagnosis with Easy Data Augment Techniques☆57Updated 3 years ago
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆32Updated 4 years ago
- GlottDNN vocoder and tools for training DNN excitation models☆32Updated 3 years ago
- Audio LPC (linear prediction code) using mel spectorgram, compatible for LPCNet☆61Updated 3 years ago
- Clustering-based methods for overlapping diarization☆71Updated 10 months ago
- Fully-Convolutional Network for Pitch Estimation of Speech Signals☆55Updated last year
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆39Updated 2 years ago
- Python package implementing the TD-PSOLA algorithm for speech processing☆42Updated 7 years ago
- The implementation of "Optimizing Shoulder to Shoulder: A Coordinated Sub-Band Fusion Model for Real-Time Full-Band Speech Enhancement"☆51Updated last year
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆27Updated 2 years ago
- Y-vector: Multiscale Waveform Encoder for Speaker Embedding☆24Updated 4 months ago
- Yin pitch estimator in PyTorch☆115Updated 2 years ago