Research code for "Towards multi-task learning of speech and speaker recognition" at https://arxiv.org/pdf/2302.12773.pdf
☆12Dec 2, 2024Updated last year
Alternatives and similar repositories for disjoint-mtl
Users that are interested in disjoint-mtl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for the EMNLP paper "Improving Detection and Categorization of Task-relevant Utterances through Integration of Discourse Structure a…☆12Nov 23, 2022Updated 3 years ago
- Broadcasted Residual Learning for Efficient Keyword Spotting☆23Jul 9, 2021Updated 4 years ago
- x86汇编语言:从实模式到保护模式_章节源码及检测题答案☆13Aug 13, 2020Updated 5 years ago
- ☆11Jun 14, 2024Updated last year
- Official Pytorch implementation of "Large Language Models are Strong Audio-Visual Speech Recognition Learners" [ICASSP 2025] and "Mitigat…☆62Jan 18, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Deformable Speech Transformer (DST)☆35Aug 8, 2024Updated last year
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Jun 13, 2023Updated 2 years ago
- A dataset of real-world robocall audio recordings☆15Jul 25, 2024Updated last year
- ☆12Oct 19, 2020Updated 5 years ago
- ☆14Jun 21, 2022Updated 3 years ago
- DO with Terraform and Ansible☆11Jun 5, 2018Updated 7 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated last year
- ☆27Jun 5, 2024Updated last year
- ☆10Dec 22, 2023Updated 2 years ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆18Feb 11, 2025Updated last year
- Once more Diarization: Improving meeting transcription systems through segment-level speaker reassignment☆14Feb 5, 2025Updated last year
- Score Normalization for NIST 2019 Speaker Recognition Evaluation☆10Nov 8, 2019Updated 6 years ago
- THU实验课实验报告模板与数据处理工具整理☆19Dec 15, 2023Updated 2 years ago
- Compute WER and SER for speech recognition evaluation☆27Mar 18, 2026Updated last month
- [TCSVT'22] Official Implementation of STI-VQA☆12Oct 18, 2023Updated 2 years ago
- ☆12Oct 17, 2024Updated last year
- Code for the Interspeech 2023 paper "A Joint Model for Pronunciation Assessment and Mispronunciation Detection and Diagnosis with Multi-t…☆25Nov 9, 2023Updated 2 years ago
- RDLINet: A Novel Lightweight Inception Network for Respiratory Disease Classification Using Lung Sounds (IEEE TIM-2024)☆11Mar 24, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code for building and experimenting on saliency maps for RL agents.☆12Feb 13, 2020Updated 6 years ago
- This repository contains the code for the paper: "DeToxy: A Large-Scale Multimodal Dataset for Toxicity Classification in Spoken Utteranc…☆20Oct 13, 2022Updated 3 years ago
- ☆11Nov 5, 2025Updated 5 months ago
- 使用SwiftUI开发的任务管理APP☆11Nov 8, 2023Updated 2 years ago
- A framework for building speech-enabled websites.☆10Jul 10, 2015Updated 10 years ago
- TyDiP Multilingual Politeness dataset and code☆12Oct 15, 2023Updated 2 years ago
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification☆14Mar 14, 2025Updated last year
- (NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class I…☆19Dec 5, 2024Updated last year
- ☆24May 18, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the paper "On the Importance of Feature Decorrelation for Unsupervised Representation Learning for RL" (ICML 2023)☆12Jun 13, 2023Updated 2 years ago
- [CVPR 2024] This is the official implementation of "MART: Masked Affective RepresenTation Learning via Masked Temporal Distribution Disti…☆22Jun 14, 2025Updated 10 months ago
- The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at I…☆18Feb 17, 2023Updated 3 years ago
- ☆17Jul 22, 2024Updated last year
- ☆16Apr 1, 2026Updated 2 weeks ago
- Data and code for the paper "End-to-End Slot Alignment and Recognition for Cross-Lingual NLU" (Accepted to EMNLP 2020)☆27Jan 13, 2022Updated 4 years ago
- ☆19Dec 29, 2024Updated last year