介绍ctc算法原理以及numpy简单实现
☆68Aug 25, 2019Updated 6 years ago
Alternatives and similar repositories for CTC-loss-introduction
Users that are interested in CTC-loss-introduction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.☆59Sep 6, 2023Updated 2 years ago
- Confidence Estimation for Black Box Automatic Speech Recognition Systems Using Lattice Recurrent Neural Networks https://arxiv.org/abs/19…☆14Apr 16, 2020Updated 6 years ago
- Connectionist Temporal Classification (CTC) decoding algorithms: best path, beam search, lexicon search, prefix search, and token passing…☆835Jan 31, 2026Updated 3 months ago
- A Python interface to OpenFst (fix FstDrawer interface issue for 1.6 version)☆17Apr 2, 2018Updated 8 years ago
- ctcloss + centerloss crnn text recognition☆200Jan 28, 2021Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Common template for pytorch project. Easy to extent and modify for new project.☆13Dec 13, 2022Updated 3 years ago
- 中文文本合成 for OCR☆12Mar 14, 2023Updated 3 years ago
- ☆12Sep 1, 2023Updated 2 years ago
- Implements of CTC, Speech-Transformer and CIF for end-to-end speech recognition with pytorch☆23Jul 28, 2020Updated 5 years ago
- mWER loss implementation in tensorflow☆31Sep 7, 2020Updated 5 years ago
- ☆11Dec 31, 2019Updated 6 years ago
- Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Juli…☆12Aug 13, 2020Updated 5 years ago
- Optimized loss based on cross-entropy (CE), like MWER (minimum WER) Loss with beam search and negative sampling strategy, Smoothed Max Po…☆25Oct 11, 2024Updated last year
- Wrapper over Yolo5Face☆19Nov 26, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆16Mar 30, 2024Updated 2 years ago
- Project for Connectionist Temporal Classification with Maximum Entropy Regularization.☆143Jun 29, 2020Updated 5 years ago
- it's ASR decoder and make graph project☆33May 26, 2022Updated 3 years ago
- PyTorch Implementations for End-to-End Automatic Speech Recognition☆127Jun 10, 2019Updated 6 years ago
- 基于rknn的yolov5的cpp实现,包含各种依赖库,是一个完整工程,可直接编译运行☆20Feb 10, 2022Updated 4 years ago
- Automatic Speech Recognition at the University of Edinburgh.☆16Mar 14, 2021Updated 5 years ago
- ☆46Nov 2, 2023Updated 2 years ago
- Code for ACL 2023 main conference paper "Understanding and Bridging the Modality Gap for Speech Translation".☆17Oct 25, 2023Updated 2 years ago
- TIoU metric in python3. Forked from https://github.com/Yuliang-Liu/TIoU-metric.☆26Nov 30, 2019Updated 6 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Targeted synthesis of multi-temporal remote sensing images for change detection using siamese neural networks☆24Feb 15, 2019Updated 7 years ago
- Polyphonic Sound Detection Score (PSDS)☆16Jan 20, 2020Updated 6 years ago
- image-segmentation and text-localization☆12Aug 22, 2018Updated 7 years ago
- CUDA-Warp RNN-Transducer☆216Feb 22, 2023Updated 3 years ago
- ☆10Dec 6, 2019Updated 6 years ago
- simple dnn based vad☆69Dec 2, 2018Updated 7 years ago
- This project provides a face recoganization system via opencv4☆18Jan 16, 2019Updated 7 years ago
- 天池大数据竞赛2017—广东政务数据创新大赛—智能算法赛☆10Apr 1, 2018Updated 8 years ago
- Matlab codes for Rectification and 3D Reconstruction of Curved Document Images (CVPR 11)☆25Feb 15, 2020Updated 6 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- PyTorch implementation of RNN-Transducer(RNN-T).☆81May 6, 2021Updated 4 years ago
- ☆12Feb 13, 2025Updated last year
- Custom decoders for Kaldi☆13Jun 5, 2019Updated 6 years ago
- Python API for reading and querying ARPA formatted language models.☆33Sep 9, 2014Updated 11 years ago
- The implementation for "Empowering Whisper as a Joint Multi-Talker and Target-Talker Speech Recognition System".☆30Aug 2, 2025Updated 9 months ago
- An experimental project for paddle python IR.☆15Dec 4, 2023Updated 2 years ago
- A gomoku AI based on Alpha Zero paper.☆12May 1, 2023Updated 3 years ago