lyj157175/Speech-transformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/lyj157175/Speech-transformer)

lyj157175 / Speech-transformer

利用transformer模型来实现语音识别系统

☆19

Alternatives and similar repositories for Speech-transformer

Users that are interested in Speech-transformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

CerryXu / pytorch-transformer
View on GitHub
Transformer模型的PyTorch实现
☆13Dec 30, 2019Updated 6 years ago
wuwusky / Machine-Learning-for-Weather-Recognition
View on GitHub
机器图像算法赛道-天气识别
☆16Oct 28, 2019Updated 6 years ago
ms-dot-k / LRW_ID
View on GitHub
The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…
☆10Oct 12, 2023Updated 2 years ago
AiIsBetter / sichuan_voice_phishing2020
View on GitHub
2020首届数字四川创新大赛-算法赛道-诈骗电话识别-rank(29/779)
☆17Feb 20, 2021Updated 5 years ago
yuguochencuc / CinCGAN-SE
View on GitHub
Joint magnitude estimation and phase recovery using Cycle-in-Cycle GAN for non-parallel speech enhancement
☆10Jan 24, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
chaufanglin / Normal2Whisper
View on GitHub
Implementation of "Improving Whispered Speech Recognition Performance using Pseudo-whispered based Data Augmentation"
☆14Oct 31, 2024Updated last year
wenet-e2e / WeTextProcessing.deprecated
View on GitHub
☆61Jan 31, 2023Updated 3 years ago
fchest / Speech-Transformer-multi-GPUs
View on GitHub
A PyTorch implementation of Speech Transformer with multi-GPUs, an End-to-End ASR with Transformer network on Mandarin Chinese. This code…
☆10Dec 25, 2019Updated 6 years ago
fangxu622 / FCN-RS-Imagery-Class
View on GitHub
基于全卷积神经网络的遥感影像分类试验
☆18Sep 11, 2017Updated 8 years ago
percent4 / transformers_chinese_text_classification
View on GitHub
本项目采用transformers模块，使用bert-base-chinese模型实现文本多分类。
☆41Jan 16, 2021Updated 5 years ago
zhoubill / Tensorflow-cnn
View on GitHub
基于tensorflow的的cnn卷积神经网络的图像识别分类
☆126Nov 14, 2018Updated 7 years ago
JohnsonLee1999 / 2021TJUThesisLatexTemplate
View on GitHub
2021届天津大学最新毕设latex模板。
☆13May 25, 2021Updated 5 years ago
xingchensong / Speech-Transformer-plus-2DAttention
View on GitHub
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆12May 7, 2019Updated 7 years ago
SarthakYadav / audiomae-plusplus-official
View on GitHub
Official repository for the paper "AudioMAE++: learning better masked audio representations with SwiGLU FFNs"
☆15Apr 30, 2026Updated 2 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
nwpuaslp / ASC_baseline
View on GitHub
☆20Nov 22, 2020Updated 5 years ago
foamliu / Speech-Transformer
View on GitHub
PyTorch re-implementation of Speech-Transformer
☆102Nov 19, 2021Updated 4 years ago
mligg23 / CarPlateIdentity
View on GitHub
基于 PyTorch 和 OpenCV 的入门级车牌识别项目
☆27Dec 26, 2020Updated 5 years ago
tgc1997 / event-based-lip-reading
View on GitHub
Multi-grained Spatio-Temporal Features Perceived Network for Event-based Lip-Reading (CVPR 2022)
☆16Jun 18, 2022Updated 4 years ago
kaituoxu / Speech-Transformer
View on GitHub
A PyTorch implementation of Speech Transformer, an End-to-End ASR with Transformer network on Mandarin Chinese.
☆810Apr 6, 2023Updated 3 years ago
mispchallenge / misp2021_baseline
View on GitHub
☆29Jun 15, 2022Updated 4 years ago
TeaPoly / CTC-OptimizedLoss
View on GitHub
Computes the MWER (minimum WER) Loss with CTC beam search. Knowledge distillation for CTC loss.
☆59Sep 6, 2023Updated 2 years ago
lansinuote / Simple_Text_to_Speech
View on GitHub
☆24Mar 13, 2025Updated last year
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
by2101 / OpenASR
View on GitHub
A pytorch based end2end speech recognition system.
☆115Jan 16, 2021Updated 5 years ago
gentaiscool / end2end-asr-pytorch
View on GitHub
End-to-End Automatic Speech Recognition on PyTorch
☆304Jun 2, 2022Updated 4 years ago
awni / future_speech
View on GitHub
The History of Speech Recognition to the Year 2030
☆13Aug 14, 2021Updated 4 years ago
summertriangle-dev / hatedelay
View on GitHub
Operating tools for texture bank files.
☆11Nov 2, 2016Updated 9 years ago
liumusicforever / CNN_Face_Detection
View on GitHub
Repository for "A Convolutional Neural Network Cascade for Face Detection", implemented with Python interface.
☆13Nov 16, 2017Updated 8 years ago
xiquan-li / FineLAP
View on GitHub
[ACL 2026 Main] FineLAP: Taming Heterogeneous Supervision for Fine-grained Language-Audio Pre-training
☆36Apr 20, 2026Updated 3 months ago
ZZDoog / Speaker2Dubber
View on GitHub
[ACM MM24] Official implementation of paper "From Speaker to Dubber: Movie Dubbing with Prosody and Duration Consistency Learning"
☆34Jul 14, 2026Updated last week
chenjiasheng / mwer
View on GitHub
mWER loss implementation in tensorflow
☆31Sep 7, 2020Updated 5 years ago
ZhengkunTian / OpenTransformer
View on GitHub
A No-Recurrence Sequence-to-Sequence Model for Speech Recognition
☆378Jul 21, 2022Updated 4 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
bml1g12 / benchmarking_video_reading_python
View on GitHub
Comparing speed of different implementations of reading video into numpy arrays
☆47Nov 19, 2022Updated 3 years ago
Hyottoko / Speex_android
View on GitHub
手机MIC录音保存成pcm文件，经Speex裸编解码再还原成pcm
☆11Mar 8, 2018Updated 8 years ago
kooBH / ULCNet
View on GitHub
[WIP]Trying to implement "Ultra Low Complexity Deep Learning Based Noise Suppression." arXiv preprint arXiv:2312.08132 (2023).
☆29May 29, 2024Updated 2 years ago
gyglim / shot-detection-evaluation
View on GitHub
evaluation of shot detection results using the RAI dataset
☆10Jun 7, 2018Updated 8 years ago
upskyy / Transformer-Transducer
View on GitHub
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASS…
☆114Feb 27, 2022Updated 4 years ago
arxrean / LipRead-seq2seq
View on GitHub
An unofficial (PyTorch) implementation for the paper Deep Lip Reading: A comparison of models and an online application.
☆10May 13, 2020Updated 6 years ago
chatopera / text-cfg-parser
View on GitHub
自然语言处理之CFG句法分析
☆10Mar 27, 2018Updated 8 years ago