Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆10Feb 29, 2024Updated 2 years ago
Alternatives and similar repositories for s3prl
Users that are interested in s3prl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated last month
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆38Feb 5, 2026Updated last month
- Transformer-based visually grounded speech models☆19Sep 22, 2022Updated 3 years ago
- a catch-all repo☆11Dec 28, 2023Updated 2 years ago
- Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learner…☆18Mar 14, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Pybind11 bindings for Kaldi☆15Feb 1, 2026Updated last month
- Transfer learning approach to pronunciation scoring☆12Jan 17, 2024Updated 2 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Dec 6, 2022Updated 3 years ago
- Python/numpy/pandas convenience wrapper for the TIMIT database.☆11Nov 26, 2018Updated 7 years ago
- code companion to manuscript☆14Feb 21, 2024Updated 2 years ago
- A Weakly Supervised Forced Alignment for disluent speech☆15Nov 12, 2023Updated 2 years ago
- R Code recipes for Functional Data Analysis for phonetic analysis.☆13Jul 31, 2024Updated last year
- ☆16Jun 13, 2024Updated last year
- A data set for Taboo games. Plain JSON files which contain the keyword and some buzzwords like in the original Taboo game☆12Oct 24, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Command line tool for forced-alignment of Spanish speech data☆13Dec 31, 2025Updated 2 months ago
- Datasets de los textos de cuentos de varios autorxs latinoamericanxs. Datasets benchmarks de distintas librerías de sentiment analysis en…☆16Sep 8, 2024Updated last year
- Mason-Alberta Phonetic Segmenter☆15Feb 24, 2026Updated last month
- Visual Search in Natural Scenes benchmark☆19Sep 19, 2024Updated last year
- The Pytorch implementation of paper Multimodal fusion for alzheimer's disease recognition☆17Aug 23, 2022Updated 3 years ago
- Zero-Shot Emotion Style Transfer☆49Apr 23, 2025Updated 11 months ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Apr 20, 2024Updated last year
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- Top 3 solution for CVPR24 SEGMENT ANYTHING IN MEDICAL IMAGES ON LAPTOP Challenge☆10Apr 8, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- ☆11Jul 3, 2023Updated 2 years ago
- attention으로 시계열 예측은 할 수 없을까☆10Apr 30, 2021Updated 4 years ago
- Instance-Dependent Noisy Label Learning via Graphical Modelling (WACV 2023 Round 1)☆13Jul 30, 2023Updated 2 years ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆18Nov 19, 2025Updated 4 months ago
- Implementation of semi-supervised learning using PyTorch Lightning☆14Jul 25, 2024Updated last year
- Deep Semi-Supervised Learning with Holistic methods for audio classification.☆11Dec 14, 2024Updated last year
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- ☆13Sep 25, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- The History of Speech Recognition to the Year 2030☆13Aug 14, 2021Updated 4 years ago
- Data & Code for FEDD published @ MICCAI 23☆12Oct 11, 2023Updated 2 years ago
- ☆19Jun 28, 2022Updated 3 years ago
- TensorFlow implementation of Disentangled Generative Model (DGM) with MNIST dataset.☆12Nov 24, 2020Updated 5 years ago
- DICOM 공부 내용 정리☆10Mar 20, 2019Updated 7 years ago
- libsoni: A Python Toolbox for Sonifying Music Annotations and Feature Representations☆27Mar 24, 2025Updated last year