nikvaessen/disjoint-mtl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nikvaessen/disjoint-mtl)

nikvaessen / disjoint-mtl

Research code for "Towards multi-task learning of speech and speaker recognition" at https://arxiv.org/pdf/2302.12773.pdf

☆12

Alternatives and similar repositories for disjoint-mtl

Users that are interested in disjoint-mtl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

sopankhosla / MedFilter
View on GitHub
Code for the EMNLP paper "Improving Detection and Categorization of Task-relevant Utterances through Integration of Discourse Structure a…
☆12Nov 23, 2022Updated 3 years ago
roman-vygon / BCResNet
View on GitHub
Broadcasted Residual Learning for Efficient Keyword Spotting
☆24Jul 9, 2021Updated 5 years ago
zzmdy520 / x86-assembly
View on GitHub
x86汇编语言:从实模式到保护模式_章节源码及检测题答案
☆13Aug 13, 2020Updated 5 years ago
SSTC-Challenge / SSTC2024_baseline_system
View on GitHub
☆12Jun 14, 2024Updated 2 years ago
HappyColor / DST
View on GitHub
Deformable Speech Transformer (DST)
☆35Aug 8, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
wspr-ncsu / robocall-audio-dataset
View on GitHub
A dataset of real-world robocall audio recordings
☆15Jul 25, 2024Updated 2 years ago
keithnoguchi / do-in-action
View on GitHub
DO with Terraform and Ansible
☆11Jun 5, 2018Updated 8 years ago
BriansIDP / RTLM
View on GitHub
☆12Oct 19, 2020Updated 5 years ago
backspacetg / distilXLSR
View on GitHub
Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model
☆13Mar 30, 2025Updated last year
rsarka34 / RDLINet
View on GitHub
RDLINet: A Novel Lightweight Inception Network for Respiratory Disease Classification Using Lung Sounds (IEEE TIM-2024)
☆11Mar 24, 2025Updated last year
MiukkaZh / MGT
View on GitHub
Learning Domain-Invariant Transformation for Speaker Verification.
☆11Jun 13, 2023Updated 3 years ago
antoniocavalcante / mustache
View on GitHub
☆14Jun 21, 2022Updated 4 years ago
duskmoon314 / THU_EXP
View on GitHub
THU实验课实验报告模板与数据处理工具整理
☆19Dec 15, 2023Updated 2 years ago
zds-potato / multilingual-phonetic-sv
View on GitHub
☆10Dec 22, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
sadiela / ml-for-audio
View on GitHub
☆18Feb 11, 2025Updated last year
noiseux1523 / NIST-SRE-2019
View on GitHub
Score Normalization for NIST 2019 Speaker Recognition Evaluation
☆10Nov 8, 2019Updated 6 years ago
fgnt / speaker_reassignment
View on GitHub
Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment
☆14Feb 5, 2025Updated last year
h4nwei / STI-VQA
View on GitHub
[TCSVT'22] Official Implementation of STI-VQA
☆12Oct 18, 2023Updated 2 years ago
vTAD2025-Challenge / vTAD
View on GitHub
☆17Oct 24, 2025Updated 9 months ago
qinxiaoyi / TimeVarying_ASV
View on GitHub
☆12Oct 17, 2024Updated last year
psaylor / spoke
View on GitHub
A framework for building speech-enabled websites.
☆10Jul 10, 2015Updated 11 years ago
wumacms / TaskManager
View on GitHub
使用SwiftUI开发的任务管理APP
☆11Nov 8, 2023Updated 2 years ago
KDL-umass / saliency_maps
View on GitHub
Code for building and experimenting on saliency maps for RL agents.
☆12Feb 13, 2020Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mavceleb / mavceleb_baseline
View on GitHub
☆11Nov 5, 2025Updated 8 months ago
Sreyan88 / Toxicity-Detection-in-Spoken-Utterances
View on GitHub
This repository contains the code for the paper: "DeToxy: A Large-Scale Multimodal Dataset for Toxicity Classification in Spoken Utteranc…
☆21Oct 13, 2022Updated 3 years ago
Genius1237 / TyDiP
View on GitHub
TyDiP Multilingual Politeness dataset and code
☆12Oct 15, 2023Updated 2 years ago
AdaLogics / paper-analyser
View on GitHub
☆25May 18, 2021Updated 5 years ago
kaen2891 / adversarial_fine-tuning_using_generated_respiratory_sound
View on GitHub
(NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class I…
☆19Dec 5, 2024Updated last year
mmmmayi / ExPO
View on GitHub
official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification
☆15Mar 14, 2025Updated last year
nku-zhichengzhang / MART
View on GitHub
[CVPR 2024] This is the official implementation of "MART: Masked Affective RepresenTation Learning via Masked Temporal Distribution Disti…
☆22Jun 14, 2025Updated last year
dojeon-ai / SimTPR
View on GitHub
Code for the paper "On the Importance of Feature Decorrelation for Unsupervised Representation Learning for RL" (ICML 2023)
☆12Jun 13, 2023Updated 3 years ago
amazon-science / multiatis
View on GitHub
Data and code for the paper "End-to-End Slot Alignment and Recognition for Cross-Lingual NLU" (Accepted to EMNLP 2020)
☆27Jan 13, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
BoyuanChen / boombox
View on GitHub
Code release for paper: The Boombox: Visual Reconstruction from Acoustic Vibrations
☆15May 18, 2021Updated 5 years ago
Bartelds / ctc-dro
View on GitHub
Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.
☆17May 16, 2025Updated last year
rithiksachdev / PostASR-Correction-SLT2024
View on GitHub
☆18Jul 22, 2024Updated 2 years ago
COINS-SS21 / moody
View on GitHub
Moody is a web application allowing the host of online meetings (e.g. via Zoom, Microsoft Teams or Google Meet) to collect real-time feed…
☆21Aug 5, 2024Updated last year
DTaoo / DMC
View on GitHub
Code for Deep Multimodal Clustering for Unsupervised Audiovisual Learning (CVPR2019)
☆15May 27, 2020Updated 6 years ago
Saurabhbhati / DASS
View on GitHub
☆12Apr 26, 2025Updated last year
kaistmm / voxceleb-disentangler
View on GitHub
[INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…
☆18Jul 23, 2024Updated 2 years ago