dayanavivolab/s3prl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dayanavivolab/s3prl)

dayanavivolab / s3prl

Self-Supervised Speech Pre-training and Representation Learning Toolkit.

☆10

Alternatives and similar repositories for s3prl

Users that are interested in s3prl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

bagustris / s3prl-ser
View on GitHub
S3PRL for Speech Emotion Recognition (see s3prl > downstream)
☆15Feb 28, 2026Updated 5 months ago
FlorinAndrei / misc
View on GitHub
a catch-all repo
☆11Dec 28, 2023Updated 2 years ago
jasonppy / FaST-VGS-Family
View on GitHub
Transformer-based visually grounded speech models
☆19Sep 22, 2022Updated 3 years ago
MontrealCorpusTools / kalpy
View on GitHub
Pybind11 bindings for Kaldi
☆15Jul 11, 2026Updated 2 weeks ago
uasolo / FDA-DH
View on GitHub
R Code recipes for Functional Data Analysis for phonetic analysis.
☆13Jul 31, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
JazminVidal / gop-ft
View on GitHub
Transfer learning approach to pronunciation scoring
☆12Jan 17, 2024Updated 2 years ago
frank613 / CTC-based-GOP
View on GitHub
This repo related to the paper "A Framework for Phoneme-Level Pronunciation Assessment Using CTC" for INTERSPEECH2024
☆41Feb 5, 2026Updated 5 months ago
zelaki / DisfluentFA
View on GitHub
A Weakly Supervised Forced Alignment for disluent speech
☆15Nov 12, 2023Updated 2 years ago
al1563 / ADprediction_code
View on GitHub
code companion to manuscript
☆14Feb 21, 2024Updated 2 years ago
vectominist / MiniASR
View on GitHub
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
☆53Dec 6, 2022Updated 3 years ago
colinator / timit_utils
View on GitHub
Python/numpy/pandas convenience wrapper for the TIMIT database.
☆11Nov 26, 2018Updated 7 years ago
Kovah / Taboo-Data
View on GitHub
A data set for Taboo games. Plain JSON files which contain the keyword and some buzzwords like in the original Taboo game
☆14Oct 24, 2023Updated 2 years ago
realyinchen / pytorch-deep-learning
View on GitHub
☆16Jun 13, 2024Updated 2 years ago
EricWilbanks / faseAlign
View on GitHub
Command line tool for forced-alignment of Spanish speech data
☆13Dec 31, 2025Updated 6 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
crazycloud / mispronunciation-detection-diagnosis-wav2vec2-and-llm
View on GitHub
Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…
☆59May 6, 2024Updated 2 years ago
MasonPhonLab / MAPS
View on GitHub
Mason-Alberta Phonetic Segmenter
☆15Feb 24, 2026Updated 5 months ago
Yougnway / MultimodalADRecognition
View on GitHub
The Pytorch implementation of paper Multimodal fusion for alzheimer's disease recognition
☆17Aug 23, 2022Updated 3 years ago
nii-yamagishilab / speaker_sex_attribute_privacy
View on GitHub
Project for HIDING SPEAKER’S SEX IN SPEECH USING ZERO-EVIDENCE SPEAKER REPRESENTATION IN AN ANALYSIS/SYNTHESIS PIPELINE
☆15Nov 30, 2022Updated 3 years ago
Lion-shine / Segment-Membranes-and-Nuclei-from-Histopathological-Images-via-Nuclei-Point-level-Supervision
View on GitHub
☆12Jul 3, 2023Updated 3 years ago
GasserElbanna / serab-byols
View on GitHub
(Hybrid) BYOL-S feature extractor using serab-byols package in pytorch.
☆27Apr 20, 2024Updated 2 years ago
muxin-wei / Rep-MedSAM
View on GitHub
Top 3 solution for CVPR24 SEGMENT ANYTHING IN MEDICAL IMAGES ON LAPTOP Challenge
☆11Apr 8, 2025Updated last year
iiscleap / ZEST
View on GitHub
Zero-Shot Emotion Style Transfer
☆49Apr 23, 2025Updated last year
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
YooSungHyun / attention-time-forecast
View on GitHub
attention으로 시계열 예측은 할 수 없을까
☆10Apr 30, 2021Updated 5 years ago
skakouros / s3prl_attentive_correlation
View on GitHub
Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit
☆13Nov 18, 2022Updated 3 years ago
arpit2412 / InstanceGM
View on GitHub
Instance-Dependent Noisy Label Learning via Graphical Modelling (WACV 2023 Round 1)
☆13Jul 30, 2023Updated 2 years ago
ylsung / lightning-semi-supervised-learning
View on GitHub
Implementation of semi-supervised learning using PyTorch Lightning
☆14Jul 25, 2024Updated 2 years ago
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
Labbeti / SSLH
View on GitHub
Deep Semi-Supervised Learning with Holistic methods for audio classification.
☆11Dec 14, 2024Updated last year
nervjack2 / Speech2Unit
View on GitHub
☆13Sep 25, 2024Updated last year
awni / future_speech
View on GitHub
The History of Speech Recognition to the Year 2030
☆13Aug 14, 2021Updated 4 years ago
vocaliodmiku / wav2vec2mdd-Text
View on GitHub
☆19Jun 28, 2022Updated 4 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
hectorcarrion / FEDD
View on GitHub
Data & Code for FEDD published @ MICCAI 23
☆12Oct 11, 2023Updated 2 years ago
stoneMo / OneAVM
View on GitHub
Official Codebase of "A Unified Audio-Visual Learning Framework for Localization, Separation, and Recognition" (ICML 2023)
☆12Jun 1, 2023Updated 3 years ago
YeongHyeon / DGM-TF
View on GitHub
TensorFlow implementation of Disentangled Generative Model (DGM) with MNIST dataset.
☆12Nov 24, 2020Updated 5 years ago
vuno-bmkim / dicom
View on GitHub
DICOM 공부 내용 정리
☆10Mar 20, 2019Updated 7 years ago
akashsara / fusion-dance
View on GitHub
Pixel VQ-VAEs for Improved Pixel Art Representation
☆17Feb 11, 2023Updated 3 years ago
groupmm / libsoni
View on GitHub
A Python Toolbox for Sonifying Music Annotations and Feature Representations
☆26Mar 24, 2025Updated last year
AusterweilLab / snafu-py
View on GitHub
Library for analyzing semantic fluency data and estimating semantic networks
☆25Jan 12, 2026Updated 6 months ago