awsaf49 / audio_classification_modelsLinks
Tensorflow Audio Classification Models
☆13Updated 2 years ago
Alternatives and similar repositories for audio_classification_models
Users that are interested in audio_classification_models are comparing it to the libraries listed below
Sorting:
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆91Updated 5 years ago
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆149Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆80Updated 2 years ago
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated 2 years ago
- [DEPRECATED] A knowledge distillation toolkit based on PyTorch and PyTorch Lightning.☆137Updated last year
- This project is about performing Speaker diarization for Hindi Language.☆58Updated 4 years ago
- Transformer implementation speciaized in speech recognition tasks using Pytorch.☆65Updated 4 years ago
- Speaker identification/verification models for Machine Learning for Computer Vision class at UNIBO☆67Updated 3 years ago
- Wav2Vec for speech recognition, classification, and audio classification☆269Updated 3 years ago
- ☆67Updated 6 months ago
- FastAudio is a Learnable Audio Frontend team Magnum's designed for the ASVspoof 2021 challenge☆45Updated 2 years ago
- Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.☆24Updated 5 years ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆38Updated last year
- A Mixed Sample Data Augmentation method for Training with Time-Frequency Domain Features☆10Updated 3 years ago
- [NeurIPS'22] Squeezeformer: An Efficient Transformer for Automatic Speech Recognition☆263Updated 2 years ago
- Neural HMMs are all you need (for high-quality attention-free TTS)☆163Updated 2 weeks ago
- ☆13Updated 2 years ago
- Implementation of the convolutional module from the Conformer paper, for use in Transformers☆429Updated 2 years ago
- This repository includes the code to reproduce our paper "Raw Differentiable Architecture Search for Speech Deepfake and Spoofing Detecti…☆11Updated 2 years ago
- A speaker gender classifier. MFC feature engineering and a pre-trained ResNet-50. GradCAM interpretation.☆27Updated 4 years ago
- Making More of Little Data: Improving Low-Resource Automatic Speech Recognition Using Data Augmentation☆17Updated 2 years ago
- Promting Whisper for Audio-Visual Speech Recognition, Code-Switched Speech Recognition, and Zero-Shot Speech Translation☆151Updated last year
- Repository containing experimentation platform on how to train, infer on wav2vec2 models.☆87Updated 3 years ago
- Library of TensorFlow layers for audio data processing and data augmentation☆20Updated 3 years ago
- PyTorch implementation of "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin" (ICML, 2016)☆26Updated 4 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆92Updated 2 years ago
- Finetune Wa2vec 2.0 For Speech Recognition☆145Updated 10 months ago
- Simple Python script to compute equal error rate (EER) for machine learning model evaluation.☆42Updated 5 years ago
- ☆49Updated 2 years ago
- An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning …☆33Updated 3 years ago