E2E-SincNet: Toward fully end-to-end speech recognition
☆30Feb 1, 2020Updated 6 years ago
Alternatives and similar repositories for E2E-SincNet
Users that are interested in E2E-SincNet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- HMM, CTC, RNN-Transducer, forward-backward algorithm☆20Sep 5, 2023Updated 2 years ago
- Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining☆12Mar 23, 2021Updated 5 years ago
- ☆15Apr 20, 2018Updated 7 years ago
- PyTorch CTC Decoder bindings☆14Nov 2, 2017Updated 8 years ago
- A lightweight real-time captioning application for macOS, powered by whisper.cpp and DeepSeek-V3.☆24Oct 11, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Non-Autoregressive Predictive Coding☆51Nov 3, 2020Updated 5 years ago
- Implementation of Imputer: Sequence Modelling via Imputation and Dynamic Programming in PyTorch☆58May 3, 2020Updated 5 years ago
- BUT Multilingual Bottleneck Features☆15Mar 22, 2019Updated 7 years ago
- A pytorch wrapper for LF-MMI training and parallel training in Kaldi☆73Jun 8, 2022Updated 3 years ago
- An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.☆370Oct 12, 2021Updated 4 years ago
- unsupervised ASR (mainly phone classifier) using EODM and GAN☆12Oct 22, 2020Updated 5 years ago
- experiments with RETURNN☆161Feb 7, 2026Updated 2 months ago
- ☆15Jun 17, 2019Updated 6 years ago
- Python wrappers for Kaldi Levenshtein's distance and alignment code.☆68Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆54Oct 17, 2023Updated 2 years ago
- End-to-end ASR/LM implementation with PyTorch☆594Aug 30, 2021Updated 4 years ago
- Recurrent Neural Aligner☆51Apr 14, 2020Updated 6 years ago
- code for paper "learning to fool the speaker recognition"☆10Jun 12, 2020Updated 5 years ago
- Memory efficient transducer loss computation☆70Jun 10, 2022Updated 3 years ago
- PyTorch implementation of LF-MMI for End-to-end ASR☆221Jan 14, 2021Updated 5 years ago
- A packaged convolutional voice activity detector for noisy environments.☆14Jun 15, 2019Updated 6 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- ☆24Jun 17, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆61Jan 31, 2023Updated 3 years ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- Tensor2tensor experiment with SpecAugment☆46May 13, 2019Updated 6 years ago
- Auto Segmentation Criterion (ASG) implemented in pytorch☆51Oct 1, 2021Updated 4 years ago
- Espresso: A Fast End-to-End Neural Speech Recognition Toolkit☆939Sep 4, 2024Updated last year
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.☆31Jun 17, 2024Updated last year
- Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…☆56Jul 6, 2023Updated 2 years ago
- ☆18Feb 9, 2020Updated 6 years ago
- Transcribing Speech with Multinomial Diffusion, training code and models.☆80Sep 27, 2023Updated 2 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- SincNet is a neural architecture for efficiently processing raw audio samples.☆1,238Apr 28, 2021Updated 4 years ago
- Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)☆104Nov 26, 2022Updated 3 years ago
- A library for speech data augmentation in time-domain☆685Aug 30, 2021Updated 4 years ago
- Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"☆10Mar 15, 2023Updated 3 years ago
- Deep learning based speech source separation using Pytorch☆319Nov 20, 2020Updated 5 years ago
- A fast parallel implementation of RNN Transducer.☆314Jun 7, 2023Updated 2 years ago
- 几种VAD算法的测评☆25Jul 31, 2020Updated 5 years ago