s3prl / s3prlLinks
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,526Updated 7 months ago
Alternatives and similar repositories for s3prl
Users that are interested in s3prl are comparing it to the libraries listed below
Sorting:
- The PyTorch-based audio source separation toolkit for researchers☆2,535Updated 4 months ago
- [Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)☆1,100Updated last month
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,637Updated last year
- This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…☆1,210Updated 5 years ago
- List of speech synthesis papers.☆1,063Updated 2 years ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,230Updated last month
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,384Updated last year
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,132Updated 2 months ago
- In defence of metric learning for speaker recognition☆1,161Updated last year
- Tools for handling multimodal data in machine learning projects.☆1,109Updated last week
- FSA/FST algorithms, differentiable, with PyTorch compatibility.☆1,304Updated 2 months ago
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆687Updated last year
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,228Updated 4 years ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆2,309Updated last year
- ☆1,359Updated 2 months ago
- A must-read paper for speech separation based on neural networks☆904Updated 5 months ago
- This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.☆1,365Updated last year
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech E…☆1,878Updated 2 years ago
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,419Updated 2 years ago
- A tutorial for Speech Enhancement researchers and practitioners. The purpose of this repo is to organize the world’s resources for speech…☆815Updated 5 years ago
- ☆1,661Updated last year
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,844Updated 6 months ago
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss☆1,090Updated last year
- Large, modern dataset for speech recognition☆718Updated last year
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆809Updated 2 years ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆911Updated last year
- speech enhancement\speech seperation\sound source localization☆1,222Updated 2 years ago
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆2,126Updated last year
- GAN-based Mel-Spectrogram Inversion Network for Text-to-Speech Synthesis☆1,035Updated 2 years ago
- A Implementation of SpecAugment with Tensorflow & Pytorch, introduced by Google Brain☆656Updated 3 years ago