PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INTERSPEECH 2020)
☆38Feb 27, 2022Updated 4 years ago
Alternatives and similar repositories for ContextNet
Users that are interested in ContextNet are comparing it to the libraries listed below
Sorting:
- End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.☆10Jan 21, 2022Updated 4 years ago
- Tensorflow2 based implementation of ContextNet, an improved convolutional rnn-transducer-based architecture for end-to-end speech recogni…☆18Oct 19, 2020Updated 5 years ago
- Paper Review about Speech Recognition · NLP☆10Mar 25, 2021Updated 4 years ago
- PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…☆113Feb 27, 2022Updated 4 years ago
- Jasper 기반 양자화된 모델인 Quartznet 한국어 음성인식☆22Jul 21, 2021Updated 4 years ago
- Automatic Speech Recognition (ASR) model QuartzNet trained on English CommonVoice. In PyTroch with CTC loss and beam search.☆16Nov 5, 2020Updated 5 years ago
- PyTorch implementation of "Deep Speech 2: End-to-End Speech Recognition in English and Mandarin" (ICML, 2016)☆26Mar 5, 2021Updated 5 years ago
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Mar 1, 2021Updated 5 years ago
- PyTorch implementation of RNN-Transducer(RNN-T).☆81May 6, 2021Updated 4 years ago
- PyTorch implementation of "Jasper: An End-to-End Convolutional Neural Acoustic Model" (INTERSPEECH 2019)☆32Mar 4, 2021Updated 5 years ago
- ☆21Feb 21, 2022Updated 4 years ago
- Repository for speech paper reading☆33Aug 19, 2021Updated 4 years ago
- Utilities for manipulating finite state transducers with the OpenFst library.☆32Sep 22, 2017Updated 8 years ago
- Tiny Transducer: A Highly-Efficient Speech Recognition Model on Edge Devices☆26Aug 4, 2022Updated 3 years ago
- a simplified version of wav2vec(1.0, vq, 2.0) in fairseq☆170Sep 21, 2020Updated 5 years ago
- 금융 도메인에 특화된 한국어 임베딩 모델☆22Aug 8, 2024Updated last year
- [ASRU 2021] Efficient Conformer: Progressive Downsampling and Grouped Attention for Automatic Speech Recognition☆219Jun 22, 2023Updated 2 years ago
- We can crawl NaverBlog, Twitter, Youtube!!☆14Sep 13, 2019Updated 6 years ago
- Dynamic Chunk Streaming and Offline Conformer based on athena-team/Athena.☆44Nov 2, 2022Updated 3 years ago
- ☆24Jan 14, 2021Updated 5 years ago
- Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.☆57May 19, 2023Updated 2 years ago
- Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.☆717Oct 23, 2023Updated 2 years ago
- Pytorch implementation of "Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions", ICASSP, 2018.☆19Jan 21, 2021Updated 5 years ago
- Tiny configuration for Triton Inference Server☆45Jan 10, 2025Updated last year
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Dec 24, 2022Updated 3 years ago
- Distributed semi-constrained microphone arrays☆31May 4, 2024Updated last year
- ☆11Oct 3, 2021Updated 4 years ago
- Review of papers I read☆14Dec 11, 2020Updated 5 years ago
- Korean speech recognition based on transformer (트랜스포머 기반 한국어 음성 인식)☆31Feb 19, 2021Updated 5 years ago
- End-to-end speech recognition on AISHELL dataset.☆34Nov 9, 2021Updated 4 years ago
- Natural Language Processing Tasks and Examples.☆61Aug 17, 2022Updated 3 years ago
- Wav2Vec2 finetune and inference code for IITP AI Grand Challenge☆36Feb 22, 2022Updated 4 years ago
- A fast and lightweight python-based CTC beam search decoder for speech recognition.☆469Jul 13, 2023Updated 2 years ago
- neural network based speaker embedder☆25Jan 7, 2023Updated 3 years ago
- PyTorch implementation of "Squeezeformer: An Efficient Transformer for Automatic Speech Recognition" (NeurIPS 2022)☆148Nov 22, 2022Updated 3 years ago
- Template that combines PyTorch Lightning and Hydra☆15Aug 15, 2023Updated 2 years ago
- RNN-Transducer for korean☆45Oct 31, 2020Updated 5 years ago
- OpenPose: A Real-Time Multi-Person Keypoint Detection And Multi-Threading C++ Library☆12Jul 13, 2017Updated 8 years ago
- This is project for korean auto spacing☆12Aug 3, 2020Updated 5 years ago