Coordinate-wise meta-learner for speaker adaptation of ASR models.
☆20Dec 30, 2019Updated 6 years ago
Alternatives and similar repositories for learning_to_adapt
Users that are interested in learning_to_adapt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Empirical Evaluation of Speaker Adaptation on DNN based Acoustic Model☆13Nov 25, 2019Updated 6 years ago
- VoxCeleb plugin for pyannote.database☆30Aug 4, 2021Updated 4 years ago
- Old language modeling tool that's used in kaldi☆17Apr 20, 2023Updated 2 years ago
- Multiobjective Optimization Training of PLDA for Speaker Verification☆10Jun 14, 2018Updated 7 years ago
- 基于Kaldi的小词汇量汉语语音识别,使用DNN训练☆27Jan 15, 2019Updated 7 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- The dataset used in COVID-DA: Deep Domain Adaptation from Typical Pneumonia to COVID-19☆12Nov 22, 2022Updated 3 years ago
- 텐서플로우 뽀개기 - python 코드를 R로 변경하기☆14Sep 5, 2017Updated 8 years ago
- Extended speech recognition neural network based on Kaldi for reproducible research☆15Aug 28, 2015Updated 10 years ago
- ☆13Jan 8, 2020Updated 6 years ago
- Training neural networks with back-prop, feedback-alignment and direct feedback-alignment☆11Mar 20, 2017Updated 9 years ago
- A Unified Framework for Metric Transfer Learning☆17Oct 28, 2017Updated 8 years ago
- ☆38May 16, 2022Updated 3 years ago
- ☆15Nov 15, 2017Updated 8 years ago
- Grapheme to phoneme converter for Estonian☆14May 27, 2021Updated 4 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Denoising autoencoders for speaker identification on MCE 2018 challenge☆12Nov 8, 2018Updated 7 years ago
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆35Oct 13, 2024Updated last year
- TensorFlow and deep learning without a PhD, translated to Chinese☆17Feb 18, 2017Updated 9 years ago
- Hilbert-Schmidt Independence Criterion☆18Jun 17, 2014Updated 11 years ago
- ☆16Mar 7, 2019Updated 7 years ago
- Open source data for data visualization enthusiasts.☆22Dec 20, 2021Updated 4 years ago
- ☆11Apr 18, 2021Updated 4 years ago
- ☆16Jun 30, 2018Updated 7 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46May 30, 2017Updated 8 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Pytorch implementation of Meta-Learned Confidence for Few-shot Learning☆71Jan 24, 2021Updated 5 years ago
- ☆40Jul 19, 2018Updated 7 years ago
- Weakly Supervised CRNN System for Sound Event Detection With Large-scale Unlabeled In-domain Data☆10Oct 31, 2018Updated 7 years ago
- ViSpeR: Multilingual Audio-Visual Speech Recognition☆56Apr 17, 2025Updated 11 months ago
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆191Jan 29, 2020Updated 6 years ago
- Code release for "Transfer Adversarial Hashing for Hamming Space Retrieval" (AAAI 2018)☆13Jun 15, 2018Updated 7 years ago
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 6 years ago
- This code implements a basic MLP for speech recognition. The MLP is trained with pytorch, while feature extraction, alignments, and dec…☆40Feb 10, 2018Updated 8 years ago
- ☆14Mar 24, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Problem Agnostic Speech Encoder☆447Jul 6, 2023Updated 2 years ago
- Simple VAD (voice activity detection) algorithm written in C☆14Jan 5, 2026Updated 2 months ago
- style token with tacotron2☆62Jul 6, 2023Updated 2 years ago
- A library for adding punctuation into a text from ASR.☆19May 8, 2023Updated 2 years ago
- Audio-Visual Speech Recognition☆21Jul 7, 2025Updated 8 months ago
- Estimate the number of concurrent speakers from single channel mixtures to crack the "cocktail-party” problem.☆22Mar 4, 2020Updated 6 years ago
- ☆13Mar 5, 2020Updated 6 years ago