利用transformer模型来实现语音识别系统
☆19Aug 11, 2020Updated 5 years ago
Alternatives and similar repositories for Speech-transformer
Users that are interested in Speech-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆15Nov 6, 2017Updated 8 years ago
- Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement☆10Jan 24, 2022Updated 4 years ago
- The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…☆10Oct 12, 2023Updated 2 years ago
- ☆61Jan 31, 2023Updated 3 years ago
- Vocoder-Free Non-Parallel Conversion of Whispered Speech With Masked Cycle-Consistent Generative Adversarial Networks☆17Aug 18, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Python implementation of CTC beam search decoder + agnostic LM scorer☆20Dec 16, 2020Updated 5 years ago
- Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"☆14Oct 31, 2024Updated last year
- 本项目采用transformers模块,使用bert-base-chinese模型实现文本多分类。☆41Jan 16, 2021Updated 5 years ago
- 基于tensorflow的的cnn卷积神经网络的图像识别分类☆126Nov 14, 2018Updated 7 years ago
- 2021届天津大学最新毕设latex模板。☆13May 25, 2021Updated 5 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆12May 7, 2019Updated 7 years ago
- PyTorch re-implementation of Speech-Transformer☆102Nov 19, 2021Updated 4 years ago
- ☆20Nov 22, 2020Updated 5 years ago
- 🤫A Lightweight One-Shot Whisper to Normal Voice Conversion Model Using Distillation of Self-Supervised Features☆24Dec 10, 2025Updated 6 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- 基于 PyTorch 和 OpenCV 的入门级车牌识别项目☆28Dec 26, 2020Updated 5 years ago
- ☆53Aug 22, 2025Updated 9 months ago
- Multi-grained Spatio-Temporal Features Perceived Network for Event-based Lip-Reading (CVPR 2022)☆14Jun 18, 2022Updated 3 years ago
- A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.☆810Apr 6, 2023Updated 3 years ago
- Pytorch implementation of DPCRN☆28Mar 31, 2024Updated 2 years ago
- ☆29Jun 15, 2022Updated 4 years ago
- End-to-End Automatic Speech Recognition on PyTorch☆304Jun 2, 2022Updated 4 years ago
- A pytorch based end2end speech recognition system.☆114Jan 16, 2021Updated 5 years ago
- ☆29Sep 29, 2022Updated 3 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- This project is intended to build and deploy an SNPE model on Qualcomm Devices, which are having unsupported layers which are not part of…☆10Oct 4, 2021Updated 4 years ago
- mxnet deploy version of pseudo-3d-residual-networks(P-3D), sport1m and Kinetics pretrained model is supported☆13Jul 27, 2018Updated 7 years ago
- convert pytorch trained yolo model to ncnn for Flexible deployment☆10Aug 30, 2018Updated 7 years ago
- Operating tools for texture bank files.☆11Nov 2, 2016Updated 9 years ago
- Repository for "A Convolutional Neural Network Cascade for Face Detection", implemented with Python interface.☆13Nov 16, 2017Updated 8 years ago
- Code for the paper: Graph Jigsaw Learning for Cartoon Face Recognition☆10Jul 1, 2022Updated 3 years ago
- ☆29Jul 12, 2024Updated last year
- Comparing speed of different implementations of reading video into numpy arrays☆47Nov 19, 2022Updated 3 years ago
- ☆46Dec 17, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- A No-Recurrence Sequence-to-Sequence Model for Speech Recognition☆378Jul 21, 2022Updated 3 years ago
- PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…☆114Feb 27, 2022Updated 4 years ago
- evaluation of shot detection results using the RAI dataset☆10Jun 7, 2018Updated 8 years ago
- Part of a research scholarship. I built a basic 2d driving sim with simulated lidar data to train Deep Q Neural Network. So far after abo…☆11Feb 15, 2017Updated 9 years ago
- Voice Conversion using Tacotron.☆11Dec 29, 2022Updated 3 years ago
- ☆36Dec 25, 2023Updated 2 years ago
- This repository contains the implementation of the paper -- KNOT: Knowledge Distillation using Optimal Transport for Solving NLP Tasks☆15Sep 15, 2022Updated 3 years ago