s3prl / s3prlLinks
Self-Supervised Speech Pre-training and Representation Learning Toolkit
☆2,432Updated last month
Alternatives and similar repositories for s3prl
Users that are interested in s3prl are comparing it to the libraries listed below
Sorting:
- [Unofficial] PyTorch implementation of "Conformer: Convolution-augmented Transformer for Speech Recognition" (INTERSPEECH 2020)☆1,045Updated last year
- The PyTorch-based audio source separation toolkit for researchers☆2,414Updated 6 months ago
- This is an open source project (formerly named Listen, Attend and Spell - PyTorch Implementation) for end-to-end ASR implemented with Pyt…☆1,203Updated 4 years ago
- Tools for handling multimodal data in machine learning projects.☆1,037Updated last month
- Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch☆1,609Updated last year
- In defence of metric learning for speaker recognition☆1,113Updated last year
- List of speech synthesis papers.☆1,052Updated last year
- ☆1,160Updated this week
- Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".☆1,312Updated 2 years ago
- 🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).☆1,966Updated last year
- FSA/FST algorithms, differentiable, with PyTorch compatibility.☆1,219Updated 2 weeks ago
- A Python library for audio data augmentation. Useful for making audio ML models work well in the real world, not just in the lab.☆2,082Updated last week
- Fast audio data augmentation in PyTorch. Inspired by audiomentations. Useful for deep learning.☆1,060Updated 5 months ago
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆2,178Updated 11 months ago
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,186Updated 4 years ago
- This repo contains the scripts, models, and required files for the Deep Noise Suppression (DNS) Challenge.☆1,249Updated 11 months ago
- Real Time Speech Enhancement in the Waveform Domain (Interspeech 2020)We provide a PyTorch implementation of the paper Real Time Speech E…☆1,799Updated 2 years ago
- A must-read paper for speech separation based on neural networks☆785Updated last month
- The Implementation of FastSpeech based on pytorch.☆873Updated 2 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,341Updated last year
- pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch,…☆2,387Updated 3 years ago
- A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.☆1,769Updated 8 months ago
- TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…☆984Updated last month
- An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"☆2,055Updated last year
- Large, modern dataset for speech recognition☆678Updated last year
- ☆1,508Updated 11 months ago
- INTERSPEECH 2023-2024 Papers: A complete collection of influential and exciting research papers from the INTERSPEECH 2023-24 conference. …☆676Updated 6 months ago
- Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit☆959Updated last week
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆811Updated 7 months ago
- AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss☆1,068Updated 8 months ago