TParcollet/E2E-SincNet

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/TParcollet/E2E-SincNet)

TParcollet / E2E-SincNet

E2E-SincNet: Toward fully end-to-end speech recognition

☆30

Alternatives and similar repositories for E2E-SincNet

Users that are interested in E2E-SincNet are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

athena-team / athena-decoder
View on GitHub
☆76Mar 18, 2022Updated 4 years ago
shanguanma / Aligners
View on GitHub
HMM, CTC, RNN-Transducer, forward-backward algorithm
☆20Sep 5, 2023Updated 2 years ago
boknilev / asr-repr-analysis
View on GitHub
☆15Apr 20, 2018Updated 8 years ago
ShigekiKarita / espnet-semi-supervised
View on GitHub
ESPnet extensions for semi-supervised end-to-end speech recognition. See also https://github.com/ShigekiKarita/espnet-semi-supervised/tre…
☆38Feb 13, 2020Updated 6 years ago
jefflai108 / Semi-Supervsied-Spoken-Language-Understanding-PyTorch
View on GitHub
Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining
☆12Mar 23, 2021Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
amoliu / pytorch-ctc
View on GitHub
PyTorch CTC Decoder bindings
☆14Nov 2, 2017Updated 8 years ago
Alexander-H-Liu / NPC
View on GitHub
Non-Autoregressive Predictive Coding
☆51Nov 3, 2020Updated 5 years ago
rosinality / imputer-pytorch
View on GitHub
Implementation of Imputer: Sequence Modelling via Imputation and Dynamic Programming in PyTorch
☆58May 3, 2020Updated 6 years ago
lucasondel / multilingual-bottleneck-features
View on GitHub
BUT Multilingual Bottleneck Features
☆15Mar 22, 2019Updated 7 years ago
idiap / pkwrap
View on GitHub
A pytorch wrapper for LF-MMI training and parallel training in Kaldi
☆74Jun 8, 2022Updated 4 years ago
facebookresearch / CPC_audio
View on GitHub
An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.
☆374Oct 12, 2021Updated 4 years ago
eastonYi / Unsupervised-ASR
View on GitHub
unsupervised ASR (mainly phone classifier) using EODM and GAN
☆12Oct 22, 2020Updated 5 years ago
rwth-i6 / returnn-experiments
View on GitHub
experiments with RETURNN
☆162Jun 18, 2026Updated last month
pzelasko / kaldialign
View on GitHub
Python wrappers for Kaldi Levenshtein's distance and alignment code.
☆70Jun 15, 2026Updated last month
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
BUTSpeechFIT / AMI-diarization-setup
View on GitHub
☆54Oct 17, 2023Updated 2 years ago
isl-mt / fluent-fisher
View on GitHub
☆15Jun 17, 2019Updated 7 years ago
hirofumi0810 / neural_sp
View on GitHub
End-to-end ASR/LM implementation with PyTorch
☆594Aug 30, 2021Updated 4 years ago
1ytic / warp-rna
View on GitHub
Recurrent Neural Aligner
☆51Apr 14, 2020Updated 6 years ago
csukuangfj / optimized_transducer
View on GitHub
Memory efficient transducer loss computation
☆70Jun 10, 2022Updated 4 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
YiwenShaoStephen / pychain
View on GitHub
PyTorch implementation of LF-MMI for End-to-end ASR
☆221Jan 14, 2021Updated 5 years ago
smallflyingpig / learning-to-fool-the-speaker-recognition
View on GitHub
code for paper "learning to fool the speaker recognition"
☆10Jun 12, 2020Updated 6 years ago
ZuoFuhong / subtitle
View on GitHub
A lightweight real-time captioning application for macOS, powered by whisper.cpp and DeepSeek-V3.
☆26Oct 11, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
bbc / bbc-speech-segmenter
View on GitHub
A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.
☆31Jun 17, 2024Updated 2 years ago
hfutami / distill-bert-for-seq2seq-asr
View on GitHub
☆24Jun 17, 2020Updated 6 years ago
sshh12 / Conv-VAD
View on GitHub
A packaged convolutional voice activity detector for noisy environments.
☆14Jun 15, 2019Updated 7 years ago
zh217 / torch-asg
View on GitHub
Auto Segmentation Criterion (ASG) implemented in pytorch
☆51Oct 1, 2021Updated 4 years ago
Kyubyong / specAugment
View on GitHub
Tensor2tensor experiment with SpecAugment
☆46May 13, 2019Updated 7 years ago
wenet-e2e / WeTextProcessing.deprecated
View on GitHub
☆61Jan 31, 2023Updated 3 years ago
1ytic / warp-rnnt
View on GitHub
CUDA-Warp RNN-Transducer
☆216Feb 22, 2023Updated 3 years ago
freewym / espresso
View on GitHub
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
☆939Sep 4, 2024Updated last year
nii-yamagishilab / Intelligibility-MetricGAN
View on GitHub
Implementation for paper "iMetricGAN: Intelligibility Enhancement for Speech-in-Noise using Generative Adversarial Network-based Metric L…
☆56Jul 6, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
RF5 / transfusion-asr
View on GitHub
Transcribing Speech with Multinomial Diffusion, training code and models.
☆80Sep 27, 2023Updated 2 years ago
patrickltobing / shallow-wavenet
View on GitHub
☆18Feb 9, 2020Updated 6 years ago
awslabs / speech-representations
View on GitHub
Code for DeCoAR (ICASSP 2020) and BERTphone (Odyssey 2020)
☆104Nov 26, 2022Updated 3 years ago
narVidhai / Speech-Transcription-Benchmarking
View on GitHub
Example python scripts to evaluate various ASR methods
☆11Dec 22, 2021Updated 4 years ago
facebookresearch / WavAugment
View on GitHub
A library for speech data augmentation in time-domain
☆689Aug 30, 2021Updated 4 years ago
AppleHolic / source_separation
View on GitHub
Deep learning based speech source separation using Pytorch
☆319Nov 20, 2020Updated 5 years ago
bagustris / ssl-ser
View on GitHub
Repository for reproducing result in journal "Self-supervised learning for Speech Emotion Recognition"
☆10Mar 15, 2023Updated 3 years ago