bepierre/SpeechVGG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/bepierre/SpeechVGG)

bepierre / SpeechVGG

Feature extractor for DL speech processing.

☆66

Alternatives and similar repositories for SpeechVGG

Users that are interested in SpeechVGG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
Neclow / SERAB
View on GitHub
SERAB: a multi-lingual benchmark for speech emotion recognition
☆28Dec 16, 2022Updated 3 years ago
fanlu / wenet
View on GitHub
Transformer based ASR Engine.
☆13Aug 23, 2021Updated 4 years ago
GasserElbanna / serab-byols
View on GitHub
(Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.
☆27Apr 20, 2024Updated 2 years ago
mutiann / speech_rankings
View on GitHub
A CSRankings-like index for speech researchers
☆35Oct 16, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
JozefColdenhoff / OpenACE
View on GitHub
☆11Aug 1, 2025Updated 11 months ago
ttslr / MonTTS
View on GitHub
☆16Dec 23, 2021Updated 4 years ago
andi611 / Mockingjay-Speech-Representation
View on GitHub
Official Implementation of Mockingjay in Pytorch
☆55Jul 6, 2023Updated 3 years ago
KanikeSaiPrakash / Speech-Emotion-Recognition
View on GitHub
Speech Emotion Recognition using Deep Learning
☆13May 24, 2021Updated 5 years ago
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
CMsmartvoice / Unet-TTS
View on GitHub
One-shot TTS with Improved Unseen Speaker and Style Transfer
☆37Mar 2, 2022Updated 4 years ago
wentaozhu / speechnas
View on GitHub
SpeechNAS-Better-Trade-off-between-Latency-and-Accuracy-for-Large-Scale-Speaker-Verification
☆30Mar 24, 2023Updated 3 years ago
AnkushMalaker / speech-emotion-recognition
View on GitHub
Implementation of the paper "Speech emotion recognition with deep convolutional neural networks" by Dias Issa Et al.
☆13Dec 18, 2021Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
gudgud96 / noisy-student-emotion-training
View on GitHub
Submission to MediaEval 2021 Emotions and Themes in Music challenge. Noisy-student training for music emotion tagging
☆11Dec 2, 2021Updated 4 years ago
TeaPoly / warp-ctc-crf
View on GitHub
An extension of thu-spmi/CAT which contains a full-fledged implementation of CTC-CRF for Tensorflow.
☆12Jul 5, 2021Updated 5 years ago
MiuLab / Lattice-Transformer-SLU
View on GitHub
Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"
☆10Jul 8, 2020Updated 6 years ago
awslabs / speech-representations
View on GitHub
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
☆104Nov 26, 2022Updated 3 years ago
Ydkwim / CTAL
View on GitHub
Pre-training Cross-modal Transformer for Audio-and-Language Representations
☆39Apr 20, 2021Updated 5 years ago
xuchennlp / S2T
View on GitHub
The project for speech translation
☆12Sep 28, 2023Updated 2 years ago
monatis / asr-annotation-bot
View on GitHub
Simple Telegram bot to annotate and varify automatic speech recognition datasets
☆12Mar 30, 2021Updated 5 years ago
ga642381 / RobustVC
View on GitHub
**ICASSP 2022** 《Toward Degradation-Robust Voice Conversion》Using speech enhancement and end-to-end denoising training to improve degrada…
☆24Sep 27, 2022Updated 3 years ago
shahrukhx01 / bert-probe
View on GitHub
BERT Probe: A python package for probing attention based robustness to character and word based adversarial evaluation. Also, with recipe…
☆18Jun 24, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
vBaiCai / python-pesq
View on GitHub
A python package for calculating the PESQ.
☆410Jul 16, 2025Updated last year
ssprl / Real-time-Blind-source-separation-using-IVA
View on GitHub
☆16Apr 24, 2021Updated 5 years ago
cyhuang-tw / robust-vc
View on GitHub
☆11May 7, 2022Updated 4 years ago
eloimoliner / audio-inpainting-diffusion
View on GitHub
☆74Apr 4, 2024Updated 2 years ago
dcaulley / av_diarization
View on GitHub
AudioVisual Diarization - Supervised and Unsupervised
☆15Nov 22, 2022Updated 3 years ago
iamanigeeit / present
View on GitHub
☆14Aug 19, 2024Updated last year
pariajm / e2e-asr-and-disfluency-removal-evaluator
View on GitHub
A new metric for evaluating end-to-end speech recognition and disfluency removal systems
☆19Mar 7, 2021Updated 5 years ago
csukuangfj / icefall
View on GitHub
☆11Jul 16, 2026Updated last week
fgnt / paderbox
View on GitHub
Paderbox: A collection of utilities for audio / speech processing
☆43Jul 21, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
atosystem / SpeechCLIP
View on GitHub
SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022
☆120Nov 25, 2022Updated 3 years ago
Quint-e / musicnn_keras
View on GitHub
Keras implementation of musicnn, a set of pre-trained deep convolutional neural networks for music audio tagging
☆27May 17, 2021Updated 5 years ago
NeuroWave-ai / CUCVAE-TTS
View on GitHub
☆25Mar 12, 2022Updated 4 years ago
popcornell / OSDC
View on GitHub
☆18Jan 26, 2021Updated 5 years ago
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
popcornell / SparseLibriMix
View on GitHub
☆73Feb 15, 2021Updated 5 years ago
ga642381 / SpeechPrompt
View on GitHub
**Interspeech 2022** 《SpeechPrompt: An Exploration of Prompt Tuning on Generative Spoken Language Model for Speech Processing Tasks》Speec…
☆102Apr 10, 2025Updated last year