This repository contains the code for the paper "Exploiting Foundation Models and Speech Enhancement for Parkinson's Disease Detection from Speech in Real-World Operative Conditions" submitted to the INTERSPEECH 2024 conference.
☆12Dec 19, 2025Updated 3 months ago
Alternatives and similar repositories for SSL4PR
Users that are interested in SSL4PR are comparing it to the libraries listed below
Sorting:
- Official source code for the paper "Tailored Design of Audio-Visual Speech Recognition Models using Branchformers"☆14Feb 24, 2025Updated last year
- Repository for the WACV 2024 paper "PsyMo: A Dataset for Estimating Self-Reported Psychological Traits from Gait"☆13Feb 22, 2024Updated 2 years ago
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆20Sep 1, 2023Updated 2 years ago
- This repository contains the code for the paper "Self-supervised Text Style Transfer using Cycle-Consistent Adversarial Networks".☆11Dec 2, 2024Updated last year
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆15Jun 11, 2024Updated last year
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆13Sep 6, 2024Updated last year
- Gemini Chat-Bot is a full-fledged conversational bot developed using Python, HTML, CSS, JavaScript, and Flask.☆10Apr 26, 2024Updated last year
- Speech-MASSIVE is a multilingual Spoken Language Understanding (SLU) dataset comprising the speech counterpart for a portion of the MASSI…☆24Oct 8, 2025Updated 5 months ago
- PACE (Podcast AI for Chapters and Episodes) is a semantic search engine that helps you find the information you need, inter- and intra-po…☆17Dec 11, 2022Updated 3 years ago
- This repository contains the official implementation of "A Benchmarking Study of Kolmogorov-Arnold Networks on Tabular Data" (under revie…☆17Jul 10, 2024Updated last year
- ☆32May 17, 2024Updated last year
- This repository contains a short introduction on the topic of audio and speech processing -- from basics to applications.☆21Dec 20, 2023Updated 2 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆10Feb 22, 2022Updated 4 years ago
- Generation tool for offset-resistant audio adversarial examples against Deepspeech☆10Oct 5, 2020Updated 5 years ago
- SemEval 2022 Task 5: Multimedia Automatic Misogyny Identification - baseline models and dataset☆13Nov 22, 2022Updated 3 years ago
- Bias Tests for Voice Technologies (bt4vt)☆11Jun 16, 2024Updated last year
- ☆15Jun 4, 2024Updated last year
- Several studies have been carried out to analyse Parkinson’s disease using speech impairments. Various tools and techniques have been use…☆12Apr 1, 2019Updated 6 years ago
- A comprehensive tutorial for proteomics data analysis in R that utilizes packages developed by researchers at PNNL and from Bioconductor.☆10May 28, 2022Updated 3 years ago
- [InterSpeech'2023] "Betray Oneself: A Novel Audio DeepFake Detection Model via Mono-to-Stereo Conversion"☆13Mar 14, 2024Updated 2 years ago
- ☆10Oct 25, 2019Updated 6 years ago
- [ACL 2024] DiFiNet: Boundary-Aware Semantic Differentiation and Filtration Network for Nested Named Entity Recognition☆16Oct 2, 2024Updated last year
- ICLR 2019 Paper, "Characterizing Audio Adversarial Examples using Temporal Dependency".☆12Apr 3, 2019Updated 6 years ago
- Accurately locating each head's position in the crowd scenes is a crucial task in the field of crowd analysis. However, traditional densi…☆21Mar 16, 2024Updated 2 years ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Nov 14, 2023Updated 2 years ago
- AnnoTheia is a data annotation toolkit that identifies when a person speaks in a scene and transcribes their speech, also offering flexib…☆27Jul 26, 2024Updated last year
- The source code for target sound detection☆15Feb 26, 2022Updated 4 years ago
- Repository for the LLM course☆31Jan 4, 2026Updated 2 months ago
- [BIBM2024 ML4BMI] Official PyTorch implementation of the paper "LV-UNet: A Lightweight and Vanilla Model for Medical Image Segmentation"☆18Oct 31, 2025Updated 4 months ago
- ALLSorts is a B-Cell Acute Lymphoblastic Leukemia (B-ALL) subtype classifier. From gene expression counts to over 18 subtypes.☆17Jul 30, 2025Updated 7 months ago
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification☆14Mar 14, 2025Updated last year
- Parkinson’s Disease Classification from Speech Data using multiple Machine Learning approaches. This was implemented using scikit-learn P…☆14Feb 2, 2020Updated 6 years ago
- Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recogni…☆18Oct 19, 2020Updated 5 years ago
- 2D residual U-Net (ResUNet) and a lead combiner (LC) for 12-lead ECG Abnormality Classification☆15Jan 4, 2024Updated 2 years ago
- A recipe for disfluency detection on the LibriStutter dataset using SpeechBrain☆11Mar 13, 2021Updated 5 years ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Jun 11, 2024Updated last year
- Autonomous Driving W/ Deep Reinforcement Learning in Lane Keeping - DDQN and SAC with kinematics/birdview-images☆13Jul 17, 2024Updated last year
- Solution for CarRacing-v0 environment from OpenAI Gym. It uses the Deep Deterministic Policy Gradient algorithm.☆12Nov 18, 2022Updated 3 years ago