ASRDeepspeech x Sakura-ML (English/Japanese) with deepspeech2 model in pytorch with support from Zakuro AI.
☆69Nov 3, 2022Updated 3 years ago
Alternatives and similar repositories for asr
Users that are interested in asr are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Japanese dictation kit using Julius☆164Apr 18, 2019Updated 6 years ago
- ☆90Mar 5, 2021Updated 5 years ago
- context labels and pronunciation data for JSUT corpus☆77Sep 2, 2021Updated 4 years ago
- Conformer encoder + Transformer decoder with Hybrid CTC/attention☆12Nov 11, 2021Updated 4 years ago
- Pytorch implementation for DeepSpeech 2.0☆31Jul 25, 2024Updated last year
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- VOICEVOX ENGINE、VOICEVOX NEMO ENGINE、COEIROINK用コマンドラインクライアント。複数のエンジンを使用した 並列処理もできます☆11May 4, 2024Updated last year
- PyTorch implementation of automatic speech recognition models.☆38Jan 10, 2021Updated 5 years ago
- 22人で童謡を5曲ずつ歌ってつくった歌唱データベースです。☆14Aug 7, 2022Updated 3 years ago
- A streamable speech recognition model with transformer encoders and RNN-T loss☆11Mar 1, 2021Updated 5 years ago
- Juliusを使ったセグメンテーション支援ツール☆13Feb 13, 2020Updated 6 years ago
- ☆12Mar 30, 2026Updated 2 weeks ago
- PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INT…☆38Feb 27, 2022Updated 4 years ago
- DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020☆22Oct 29, 2020Updated 5 years ago
- PyTorch implementation of Listen Attend and Spell Automatic Speech Recognition (ASR).☆39Jul 25, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- RunPod Go SDK☆11May 16, 2024Updated last year
- Add pitch accent notation to Jisho.org (chrome extension)☆24Aug 14, 2019Updated 6 years ago
- NIILC QA data☆18Nov 20, 2015Updated 10 years ago
- Yolo(including yolov1 yolov2 yolov3)running on caffe windows. Anyone that is not familiar with linux can use this project to learn caffe …☆18Jun 15, 2018Updated 7 years ago
- Open-Source Toolkit for End-to-End Korean Automatic Speech Recognition leveraging PyTorch and Hydra.☆637May 27, 2023Updated 2 years ago
- Rust wrapper for the cld2 language detection library.☆16Nov 28, 2017Updated 8 years ago
- A Pytorch Implementation of paper: "Neural Color Operators for Sequential Image Retouching", ECCV 2022☆10Oct 25, 2022Updated 3 years ago
- ☆11Nov 28, 2025Updated 4 months ago
- Pytorch implemenation of the model proposed in the paper: Double Multi-Head Attention for Speaker Verification☆20Jul 25, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Awesome Quantization Paper lists with Codes☆10Feb 24, 2021Updated 5 years ago
- Python interface to Julius speech recognition engine☆27Jul 15, 2020Updated 5 years ago
- ☆43Mar 30, 2024Updated 2 years ago
- A liquify effect tool demo for the article http://geekofficedog.blogspot.com/2015/01/liquify-effect-hello-swirl-2.html☆15Apr 10, 2015Updated 11 years ago
- 時雨堂 Sphinx テーマ☆16Feb 3, 2026Updated 2 months ago
- Eyesight Detection using Openface and ML☆13Mar 28, 2022Updated 4 years ago
- Singing Style Transfer using Deep U-net for vocal separation & CycleConsistencyBoundaryEquilibrium GAN(Cycle-BEGAN) for vocal style trans…☆34Sep 17, 2019Updated 6 years ago
- A demo project demonstrating the performance improvement by cpp extension, which wrapped with pybind11.☆10Nov 16, 2021Updated 4 years ago
- A Chrome extension to display LaTeX flavoured math in GitHub Markdown previews.☆12Mar 5, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Open-Source Large Vocabulary Continuous Speech Recognition Engine☆1,926Jun 16, 2025Updated 10 months ago
- ☆46Mar 18, 2016Updated 10 years ago
- ☆10May 15, 2021Updated 4 years ago
- ☆19May 9, 2019Updated 6 years ago
- Collection of materials and links to talks given, tools presented, software demos etc.☆13Jun 8, 2023Updated 2 years ago
- ☆10Feb 17, 2023Updated 3 years ago
- J-Net is aimed for audio separation with randomly weighted encoder.☆12Oct 23, 2019Updated 6 years ago