Implementation of CTC alignment-based single step non-autoregressive transformer
☆13Jun 2, 2023Updated 3 years ago
Alternatives and similar repositories for cassnat_asr
Users that are interested in cassnat_asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Hpyformer base FunASR☆31Nov 5, 2024Updated last year
- End-to-End Speech Processing Toolkit☆16Jan 20, 2025Updated last year
- A toolkit for Spoken Language Understanding Evaluation (SLUE) benchmark. Refer paper https://arxiv.org/abs/2111.10367 for more details. O…☆66Feb 26, 2024Updated 2 years ago
- 来自于文章Paraformer-v2: An improved non-autoregressive transformer for noise-robust speech recognition☆29Nov 20, 2024Updated last year
- The baseline system for the ICASSP2024 ICMC-ASR Challenge.☆57Dec 6, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆16May 25, 2019Updated 7 years ago
- ☆18Sep 19, 2023Updated 2 years ago
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- Data and code related to the ICASSP submission "A comparison of methods for OOV-word recognition"☆17Nov 28, 2021Updated 4 years ago
- ☆67Mar 25, 2022Updated 4 years ago
- ☆10Jun 23, 2023Updated 2 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆221Jan 14, 2021Updated 5 years ago
- Numpydocs -> mkdocs friendly markdown☆12Jun 10, 2022Updated 4 years ago
- C++ PyTorch Examples☆10Aug 18, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Aug 23, 2025Updated 9 months ago
- ☆55Jan 13, 2023Updated 3 years ago
- mWER loss implementation in tensorflow☆31Sep 7, 2020Updated 5 years ago
- Lightweight Bayesian deep learning library for fast prototyping based on PyTorch☆14Feb 24, 2023Updated 3 years ago
- PyTorch bindings for Warp-CTC☆42Dec 6, 2019Updated 6 years ago
- A recipe for disfluency detection on the LibriStutter dataset using SpeechBrain☆11Mar 13, 2021Updated 5 years ago
- SChunk-Encoder (Transformer or Conformer) for streaming E2E ASR☆11Oct 21, 2022Updated 3 years ago
- ☆25May 26, 2026Updated 3 weeks ago
- CLEAR benchmark (NeurIPS 2021 Dataset & Benchmark)☆27Apr 23, 2023Updated 3 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Raw waveform adaptation with SincNet☆12Mar 19, 2024Updated 2 years ago
- The project for speech translation☆12Sep 28, 2023Updated 2 years ago
- Speech enhancement using mimic loss☆16Oct 25, 2019Updated 6 years ago
- This is now the official location of the Kaldi project.☆24Nov 13, 2019Updated 6 years ago
- ☆11Oct 20, 2022Updated 3 years ago
- kaldi cnn-tdnnf baseline☆13Aug 31, 2021Updated 4 years ago
- This repo contains the code for "Voice Disorder Analysis: A Transformer-based Approach", accepted at Interspeech 2024☆14Jun 11, 2024Updated 2 years ago
- ☆11Dec 28, 2023Updated 2 years ago
- Ship remote sensing dataset☆12Jun 28, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Autoregressive Predictive Coding: An unsupervised autoregressive model for speech representation learning☆191Jan 29, 2020Updated 6 years ago
- PyTorch implementation of the Factorized TDNN (TDNN-F) from "Semi-Orthogonal Low-Rank Matrix Factorization for Deep Neural Networks" and …☆149Jan 6, 2020Updated 6 years ago
- A database of clean and noisy speech for audio research☆10Jan 26, 2018Updated 8 years ago
- Voice Conversion with GANs☆15Jul 5, 2018Updated 7 years ago
- Pytorch implementation of "Towards Practical and Efficient Image-to-Speech Captioning with Vision-Language Pre-training and Multi-modal T…☆12Apr 29, 2026Updated last month
- ☆45Sep 28, 2025Updated 8 months ago
- ☆18Jun 5, 2026Updated last week