s3prl / s3prlLinks

Self-Supervised Speech Pre-training and Representation Learning Toolkit

☆2,526

Alternatives and similar repositories for s3prl

Users that are interested in s3prl are comparing it to the libraries listed below

Sorting:

asteroid-team / asteroid
The PyTorch-based audio source separation toolkit for researchers
☆2,535Updated 4 months ago
sooftware / conformer
[Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)
☆1,100Updated last month
kan-bayashi / ParallelWaveGAN
Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch
☆1,637Updated last year
Alexander-H-Liu / End-to-end-ASR-Pytorch
This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…
☆1,210Updated 5 years ago
wenet-e2e / speech-synthesis-paper
List of speech synthesis papers.
☆1,063Updated 2 years ago
iver56 / audiomentations
A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.
☆2,230Updated last month
coqui-ai / open-speech-corpora
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
☆1,384Updated last year
iver56 / torch-audiomentations
Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.
☆1,132Updated 2 months ago
clovaai / voxceleb_trainer
In defence of metric learning for speaker recognition
☆1,161Updated last year
lhotse-speech / lhotse
Tools for handling multimodal data in machine learning projects.
☆1,109Updated last week
k2-fsa / k2
FSA/FST algorithms, differentiable, with PyTorch compatibility.
☆1,304Updated 2 months ago
DmitryRyumin / INTERSPEECH-2023-24-Papers
INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …
☆687Updated last year
mravanelli / SincNet
SincNet is a neural architecture for efficiently processing raw audio samples.
☆1,228Updated 4 years ago
jik876 / hifi-gan
HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis
☆2,309Updated last year
k2-fsa / icefall
☆1,359Updated 2 months ago
JusperLee / Speech-Separation-Paper-Tutorial
A must-read paper for speech separation based on neural networks
☆904Updated 5 months ago
microsoft / DNS-Challenge
This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.
☆1,365Updated last year
facebookresearch / denoiser
Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech E…
☆1,878Updated 2 years ago
YuanGongND / ast
Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".
☆1,419Updated 2 years ago
nanahou / Awesome-Speech-Enhancement
A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech…
☆815Updated 5 years ago
qiuqiangkong / audioset_tagging_cnn
☆1,661Updated last year
wq2012 / awesome-diarization
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
☆1,844Updated 6 months ago
auspicious3000 / autovc
AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss
☆1,090Updated last year
SpeechColab / GigaSpeech
Large, modern dataset for speech recognition
☆718Updated last year
kaituoxu / Speech-Transformer
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆809Updated 2 years ago
gabrielmittag / NISQA
NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment
☆911Updated last year
WenzheLiu-Speech / awesome-speech-enhancement
speech enhancement\speech seperation\sound source localization
☆1,222Updated 2 years ago
jim-schwoebel / voice_datasets
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
☆2,126Updated last year
descriptinc / melgan-neurips
GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis
☆1,035Updated 2 years ago
DemisEom / SpecAugment
A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain
☆656Updated 3 years ago