K-STMLab/SSL4PR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/K-STMLab/SSL4PR)

K-STMLab / SSL4PR

This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions" submitted to the INTERSPEECH 2024 conference.

☆12

Alternatives and similar repositories for SSL4PR

Users that are interested in SSL4PR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Annafavaro / PARKCELEB
View on GitHub
☆11Jun 13, 2026Updated last month
lokendarjangid / gemini-chat
View on GitHub
Gemini Chat-Bot is a full-fledged conversational bot developed using Python, HTML, CSS, JavaScript, and Flask.
☆10Apr 26, 2024Updated 2 years ago
gallipoligiuseppe / TST-CycleGAN
View on GitHub
This repository contains the code for the paper "Self-supervised Text Style Transfer using Cycle-Consistent Adversarial Networks".
☆11Dec 2, 2024Updated last year
david-gimeno / tailored-avsr
View on GitHub
Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"
☆14Feb 24, 2025Updated last year
cosmaadrian / psymo
View on GitHub
Repository for the WACV 2024 paper "PsyMo: A Dataset for Estimating Self-Reported Psychological Traits from Gait"
☆14Feb 22, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
hmartelb / avlit
View on GitHub
Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…
☆20Sep 1, 2023Updated 2 years ago
koudounasalkis / AI4Voice
View on GitHub
This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024
☆15Jun 11, 2024Updated 2 years ago
JeongHun0716 / VoxLRS-SA
View on GitHub
This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)
☆13Sep 6, 2024Updated last year
eipm / bridge2ai-redcap
View on GitHub
Bridge2AI Voice | REDCap
☆16Jul 14, 2026Updated last week
hlt-mt / Speech-MASSIVE
View on GitHub
Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…
☆25Oct 8, 2025Updated 9 months ago
ALM-LAB / PACE
View on GitHub
PACE (Podcast AI for Chapters and Episodes) is a semantic search engine that helps you find the information you need, inter- and intra-po…
☆17Dec 11, 2022Updated 3 years ago
eleonorapoeta / benchmarking-KAN
View on GitHub
This repository contains the official implementation of "A Benchmarking Study of Kolmogorov-Arnold Networks on Tabular Data" (under revie…
☆17Jul 10, 2024Updated 2 years ago
Kuray107 / S4ND-U-Net_speech_enhancement
View on GitHub
☆33May 17, 2024Updated 2 years ago
MIND-Lab / SemEval2022-Task-5-Multimedia-Automatic-Misogyny-Identification-MAMI-
View on GitHub
SemEval 2022 Task 5: Multimedia Automatic Misogyny Identification - baseline models and dataset
☆15Nov 22, 2022Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
koudounasalkis / Audio-Speech-Tutorial
View on GitHub
This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.
☆19Dec 20, 2023Updated 2 years ago
joactr / AnnoTheia
View on GitHub
AnnoTheia is a data annotation toolkit that identifies when a person speaks in a scene and transcribes their speech, also offering flexib…
☆27Jul 26, 2024Updated last year
PNNL-Comp-Mass-Spec / proteomics-data-analysis-tutorial
View on GitHub
A comprehensive tutorial for proteomics data analysis in R that utilizes packages developed by researchers at PNNL and from Bioconductor.
☆11May 28, 2022Updated 4 years ago
SHAHFAISAL80 / Crowd-localization-and-counting
View on GitHub
Accurately locating each head's position in the crowd scenes is a crucial task in the field of crowd analysis. However, traditional densi…
☆21Mar 16, 2024Updated 2 years ago
chorowski-lab / CPC_audio
View on GitHub
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆10Feb 22, 2022Updated 4 years ago
umbertocappellazzo / Llama-AVSR
View on GitHub
Official Pytorch implementation of "Large Language Models are Strong Audio-Visual Speech Recognition Learners" [ICASSP 2025] and "Mitigat…
☆64Jan 18, 2026Updated 6 months ago
RefDawn-XD / TEFDTA
View on GitHub
☆15Jun 4, 2024Updated 2 years ago
vivraj17 / Detection-Of-Parkinson-s-Disesase-Using-Voice-Impairments-With-ML-and-LSTM
View on GitHub
Several studies have been carried out to analyse Parkinson’s disease using speech impairments. Various tools and techniques have been use…
☆12Apr 1, 2019Updated 7 years ago
Fraunhofer-AISEC / towards-resistant-audio-adversarial-examples
View on GitHub
Generation tool for offset-resistant audio adversarial examples against Deepspeech
☆10Oct 5, 2020Updated 5 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
wiebket / bt4vt
View on GitHub
Bias Tests for Voice Technologies (bt4vt)
☆11Jun 16, 2024Updated 2 years ago
ttslr / M2S-ADD
View on GitHub
[InterSpeech'2023] "Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion"
☆14Mar 14, 2024Updated 2 years ago
rahul-t-p / ASVspoof-2019
View on GitHub
☆10Oct 25, 2019Updated 6 years ago
AI-secure / Characterizing-Audio-Adversarial-Examples-using-Temporal-Dependency
View on GitHub
ICLR 2019 Paper, "Characterizing Audio Adversarial Examples using Temporal Dependency".
☆11Apr 3, 2019Updated 7 years ago
yangdongchao / Target-sound-event-detection
View on GitHub
The source code for target sound detection
☆15Feb 26, 2022Updated 4 years ago
Honee-W / CPTNN
View on GitHub
unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"
☆15Nov 14, 2023Updated 2 years ago
AONE-NLP / DiFiNet
View on GitHub
[ACL 2024] DiFiNet: Boundary-Aware Semantic Differentiation and Filtration Network for Nested Named Entity Recognition
☆19Oct 2, 2024Updated last year
imadtoubal / Parkinson-s-Disease-Classification-from-Speech-Data
View on GitHub
Parkinson’s Disease Classification from Speech Data using multiple Machine Learning approaches. This was implemented using scikit-learn P…
☆14Feb 2, 2020Updated 6 years ago
ishine / ContextNet
View on GitHub
Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recogni…
☆18Oct 19, 2020Updated 5 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Oshlack / ALLSorts
View on GitHub
ALLSorts is a B-Cell Acute Lymphoblastic Leukemia (B-ALL) subtype classifier. From gene expression counts to over 18 subtypes.
☆18Jul 30, 2025Updated 11 months ago
mmmmayi / ExPO
View on GitHub
official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification
☆14Mar 14, 2025Updated last year
hongfeixue / StutteringSpeechChallenge
View on GitHub
SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge
☆12Jun 11, 2024Updated 2 years ago
seorim0 / ResUNet-LC
View on GitHub
2D residual U-Net (ResUNet) and a lead combiner (LC) for 12-lead ECG Abnormality Classification
☆15Jan 4, 2024Updated 2 years ago
OSU-slatelab / LibriStutter
View on GitHub
A recipe for disfluency detection on the LibriStutter dataset using SpeechBrain
☆11Mar 13, 2021Updated 5 years ago
mozturan / AutonomousDrive2D-DRL
View on GitHub
Autonomous Driving W/ Deep Reinforcement Learning in Lane Keeping - DDQN and SAC with kinematics/birdview-images
☆13Mar 24, 2026Updated 3 months ago
wpiszlogin / driver_critic
View on GitHub
Solution for CarRacing-v0 environment from OpenAI Gym. It uses the Deep Deterministic Policy Gradient algorithm.
☆12Nov 18, 2022Updated 3 years ago