Self-Supervised Speech Pre-training and Representation Learning Toolkit.
☆10Feb 29, 2024Updated 2 years ago
Alternatives and similar repositories for s3prl
Users that are interested in s3prl are comparing it to the libraries listed below
Sorting:
- S3PRL for Speech Emotion Recognition (see s3prl > downstream)☆15Feb 28, 2026Updated last week
- Transformer-based visually grounded speech models☆19Sep 22, 2022Updated 3 years ago
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆53Dec 6, 2022Updated 3 years ago
- (Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.☆27Apr 20, 2024Updated last year
- This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024☆36Feb 5, 2026Updated last month
- Koel Labs innovates open-source speech research, inclusive speech technologies, and real-time pronunciation feedback for language learner…☆18Feb 25, 2026Updated last week
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Nov 18, 2022Updated 3 years ago
- Directed masked autoencoders☆14Feb 20, 2026Updated 2 weeks ago
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆17Nov 19, 2025Updated 3 months ago
- Instance-Dependent Noisy Label Learning via Graphical Modelling (WACV 2023 Round 1)☆13Jul 30, 2023Updated 2 years ago
- COLA contrastive pre-training method implemented in PyTorch☆43Jan 27, 2021Updated 5 years ago
- Hed and supporting files for Chinese NNSVS Dataset Creation☆13Oct 14, 2025Updated 4 months ago
- ☆11Jul 3, 2023Updated 2 years ago
- DICOM 공부 내용 정리☆10Mar 20, 2019Updated 6 years ago
- A collection of self-supervised papers in medical imaging.☆40Mar 16, 2021Updated 4 years ago
- attention으로 시계열 예측은 할 수 없을까☆10Apr 30, 2021Updated 4 years ago
- Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE☆15Nov 30, 2022Updated 3 years ago
- ☆10Oct 20, 2022Updated 3 years ago
- ☆13Sep 25, 2024Updated last year
- Python/numpy/pandas convenience wrapper for the TIMIT database.☆11Nov 26, 2018Updated 7 years ago
- AD-TUNING: An Adaptive CHILD-TUNING Approach to Efficient Hyperparameter Optimization of Child Networks for Speech Processing Tasks in th…☆11Feb 23, 2024Updated 2 years ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- [WACV2023] This is the official PyTorch impelementation of our paper "[Rethinking Rotation in Self-Supervised Contrastive Learning: Adapt…☆12Feb 24, 2023Updated 3 years ago
- Deep Semi-Supervised Learning with Holistic methods for audio classification.☆11Dec 14, 2024Updated last year
- ☆11Nov 27, 2022Updated 3 years ago
- Speechflow for emotion recognition related information decomposition☆10Jul 27, 2021Updated 4 years ago
- Tidy handling and navigation of the valuable Student-Life mHealth dataset☆21Apr 22, 2021Updated 4 years ago
- ☆21Sep 27, 2024Updated last year
- Real Time STT model with GPU by Whisper and VAD(Voice Activity Detector) model☆15Jul 15, 2024Updated last year
- Pybind11 bindings for Kaldi☆15Feb 1, 2026Updated last month
- An ASR toolkit with the freedom of topology☆10Dec 18, 2023Updated 2 years ago
- a catch-all repo☆11Dec 28, 2023Updated 2 years ago
- R Code recipes for Functional Data Analysis for phonetic analysis.☆13Jul 31, 2024Updated last year
- Official PyTorch Implementation for CAiD: Context-Aware Instance Discrimination for Self-supervised Learning in Medical Imaging - MIDL 20…☆11Apr 15, 2022Updated 3 years ago
- TensorFlow implementation of Disentangled Generative Model (DGM) with MNIST dataset.☆12Nov 24, 2020Updated 5 years ago
- Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)☆12Jun 1, 2023Updated 2 years ago
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- Top 3 solution for CVPR24 SEGMENT ANYTHING IN MEDICAL IMAGES ON LAPTOP Challenge☆10Apr 8, 2025Updated 11 months ago
- Visual Search in Natural Scenes benchmark☆19Sep 19, 2024Updated last year