kaldi cnn-tdnnf baseline
☆13Aug 31, 2021Updated 4 years ago
Alternatives and similar repositories for kaldi-baseline
Users that are interested in kaldi-baseline are comparing it to the libraries listed below
Sorting:
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- Getting confidences from any end-to-end systems☆11May 24, 2023Updated 2 years ago
- Target speaker automatic speech recognition (TS-ASR)☆12Oct 14, 2023Updated 2 years ago
- Auto-KWS 2021 Challenge 1st place solution.☆11Jul 20, 2021Updated 4 years ago
- The case study and multilingfual performance of ICASSP submission☆24Sep 24, 2022Updated 3 years ago
- End-to-End Keyword Spotting (E2E-KWS) using a character level LSTM☆43Nov 18, 2022Updated 3 years ago
- Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)☆13Nov 14, 2024Updated last year
- Framework for Detection Evaluation (F4DE) : set of evaluation tools for detection evaluations and for specific NIST-coordinated evaluatio…☆25Jul 6, 2017Updated 8 years ago
- Models and codes for INTERSPEECH 2023 paper DistilXLSR: A Light Weight Cross-Lingual Speech Representation Model☆13Mar 30, 2025Updated 11 months ago
- ☆29Jun 15, 2022Updated 3 years ago
- Keyword Search Recipe for Subword ASR☆30Jul 12, 2019Updated 6 years ago
- ☆13Mar 25, 2021Updated 4 years ago
- E2E system with LF-MMI; word N-gram for Mandarin☆166Apr 29, 2022Updated 3 years ago
- ☆15Aug 25, 2022Updated 3 years ago
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…☆16Oct 22, 2022Updated 3 years ago
- Automatic Speech Recognition (ASR) system for the Samrómur speech corpus using Kaldi☆12Sep 30, 2022Updated 3 years ago
- Light-weight transfer learning framework for on-device speech and audio recognition using pre-trained image convolutional neural networks…☆18Apr 16, 2022Updated 3 years ago
- ☆17Jul 22, 2024Updated last year
- Neural network density models for speech separation.☆20Nov 26, 2020Updated 5 years ago
- ☆15Jul 4, 2024Updated last year
- A wrapper for Audeering's wav2vec-based dimensional speech emotion recognition☆21Aug 9, 2023Updated 2 years ago
- 3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition☆118Jun 22, 2022Updated 3 years ago
- magicspeech competition recipe☆18Jun 29, 2020Updated 5 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- Scripts for data generation, scoring and data manifest preparation for CHiME-8 DASR task.☆24Feb 25, 2025Updated last year
- Script to perform statistical significance test between ASR hypotheses.☆22Aug 13, 2017Updated 8 years ago
- This repository contains the code for our upcoming paper An Investigation of End-to-End Models for Robust Speech Recognition at ICASSP 20…☆49Dec 25, 2024Updated last year
- ☆22Jun 24, 2024Updated last year
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆28Dec 4, 2024Updated last year
- ☆50Dec 26, 2020Updated 5 years ago
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago
- ☆28Nov 15, 2023Updated 2 years ago
- Just another FastSpeech 2 but cleaner code :)☆29Jun 28, 2024Updated last year
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆54Dec 6, 2023Updated 2 years ago
- ☆27Aug 31, 2022Updated 3 years ago
- Production first, nn-based on-device signal processing toolkit.☆65May 30, 2023Updated 2 years ago
- ☆27Oct 25, 2024Updated last year
- CMU multilingual speech repository☆30Apr 15, 2022Updated 3 years ago