dredwardhyde / speech-recognition-examples
☆26Updated this week
Related projects: ⓘ
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆26Updated 9 years ago
- ☆74Updated 2 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆31Updated 3 years ago
- Keras(Tensorflow) implementations of Automatic Speech Recognition☆22Updated 2 years ago
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆41Updated last year
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf☆57Updated 3 years ago
- Zero-shot Audio Classification using Whisper☆74Updated last year
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆33Updated 2 years ago
- SpeechYOLO Interspeech 2019☆42Updated 2 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 4 years ago
- Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.☆45Updated 3 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…☆24Updated 3 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆47Updated last year
- ASR project with pytorch-lightning☆20Updated 4 years ago
- ☆12Updated last year
- Machine learning experiment to perform gender classification from raw audio.☆23Updated 6 years ago
- A lightweight library to compute Diarization Error Rate (DER).☆59Updated last year
- ☆23Updated 5 years ago
- A streamlit application that lets you explore the effect of different audio augmentation techniques☆27Updated 2 years ago
- Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"☆64Updated last year
- PyTorch implementation of automatic speech recognition models.☆38Updated 3 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.☆81Updated last year
- The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…☆43Updated 11 months ago
- ☆56Updated last year
- Unsupervised Speech Decomposition via Triple Information Bottleneck☆14Updated 4 years ago
- Simple text to phonemes converter for multiple languages☆21Updated last year
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆12Updated last year
- The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…☆29Updated 11 months ago