dredwardhyde / speech-recognition-examples

☆26

Related projects: ⓘ

jpuigcerver / xer
Compute useful transcriptions metrics (CER, WER, SER, ...)
☆26Updated 9 years ago
asappresearch / sew
☆74Updated 2 years ago
chuachinhon / wav2vec2_transformers
Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…
☆31Updated 3 years ago
scionoftech / DeepAsr
Keras(Tensorflow) implementations of Automatic Speech Recognition
☆22Updated 2 years ago
MiviaLab / DENet
This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.
☆41Updated last year
Sreyan88 / Disfluency-Detection-with-Span-Classification
This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…
☆12Updated last year
shangeth / SpeakerProfiling
Estimating the Age, Height, and Gender of a speaker with their speech signal. https://arxiv.org/pdf/2110.13653.pdf
☆57Updated 3 years ago
jumon / zac
Zero-shot Audio Classification using Whisper
☆74Updated last year
upskyy / ContextNet
PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…
☆33Updated 2 years ago
MLSpeech / speech_yolo
SpeechYOLO Interspeech 2019
☆42Updated 2 years ago
juanmc2005 / SimilarityLearning
Similarity Learning applied to Speaker Verification and Semantic Textual Similarity
☆12Updated 4 years ago
sooftware / lightning-asr
Modular and extensible speech recognition library leveraging pytorch-lightning and hydra.
☆45Updated 3 years ago
daanzu / wav2vec2_stt_python
Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogni…
☆24Updated 3 years ago
vectominist / MiniASR
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆47Updated last year
tongjinle123 / speech-transformer-pytorch_lightning
ASR project with pytorch-lightning
☆20Updated 4 years ago
tts-tutorial / ijcai2021
☆12Updated last year
oscarknagg / raw-audio-gender-classification
Machine learning experiment to perform gender classification from raw audio.
☆23Updated 6 years ago
wq2012 / SimpleDER
A lightweight library to compute Diarization Error Rate (DER).
☆59Updated last year
YashNita / sound-event-detection-winning-method
☆23Updated 5 years ago
phrasenmaeher / audio-transformation-visualization
A streamlit application that lets you explore the effect of different audio augmentation techniques
☆27Updated 2 years ago
Deepest-Project / Transformer-TTS
Implementation of "FastSpeech: Fast, Robust and Controllable Text to Speech"
☆64Updated last year
sooftware / End-to-End-Speech-Recognition-Models
PyTorch implementation of automatic speech recognition models.
☆38Updated 3 years ago
Edresson / Wav2Vec-Wrapper
An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.
☆81Updated last year
joaoantoniocn / AM-SincNet
The Additive Margin SincNet (AM-SincNet) is a new approach for speaker recognition problems which is based in the neural network architec…
☆43Updated 11 months ago
MiniXC / LightningFastSpeech2
☆56Updated last year
auspicious3000 / SpeechSplit-Demo
Unsupervised Speech Decomposition via Triple Information Bottleneck
☆14Updated 4 years ago
resemble-ai / phonemizer
Simple text to phonemes converter for multiple languages
☆21Updated last year
unoti / voice-embeddings
Audio processing using deep neural networks. Speaker identification using voice embeddings.
☆12Updated last year
joaoantoniocn / AM-MobileNet1D
The Additive Margin MobileNet1D is a new light weight deep learning model for Speaker Recognition which is based on the MobileNetV2 archi…
☆29Updated 11 months ago