msalhab96/Listen-Attend-and-Spell

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/msalhab96/Listen-Attend-and-Spell)

msalhab96 / Listen-Attend-and-Spell

PyTorch implementation of Listen, Attend and Spell (LAS) speech recognition paper

☆12

Alternatives and similar repositories for Listen-Attend-and-Spell

Users that are interested in Listen-Attend-and-Spell are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

taskswithcode / sota_researchers_with_published_code
View on GitHub
Researchers who published code, models (in some cases), and demo apps (in few cases) along with their SOTA paper
☆12Oct 19, 2023Updated 2 years ago
alefiury / SE-R-2022-SER-Track
View on GitHub
Code for the winning solution in the SE&R 2022 Challenge - SER track.
☆16Mar 28, 2023Updated 3 years ago
johndpope / Singing-Voice-Conversion-with-conditional-VAW-GAN
View on GitHub
This is the implementation of the paper "VAW-GAN for Singing Voice Conversion withNon-parallel Training Data".
☆17Aug 12, 2020Updated 5 years ago
applicaai / pyramidions
View on GitHub
This repository contains a demonstrative implementation for pooling-based models, e.g., DeepPyramidion complementing our paper "Sparsifyi…
☆14May 15, 2022Updated 4 years ago
ex3ndr / supervoice-enhance
View on GitHub
Supervoice diffusion enhance
☆28Jul 15, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
NoSavedDATA / Neve
View on GitHub
NSK Coding Language: Fast and Simple
☆15Jul 6, 2026Updated 3 weeks ago
esotericpig / nhkore
View on GitHub
🇯🇵📰🗻 NHK News Web (Easy) word frequency (core list) scraper for Japanese language learners.
☆16Sep 19, 2025Updated 10 months ago
msalhab96 / MultiSpeech
View on GitHub
pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper
☆21Jun 23, 2022Updated 4 years ago
haroldserrano / GettingStartedWithMetal
View on GitHub
Getting Started with the Metal API
☆11Jan 7, 2017Updated 9 years ago
josepegea / termux_ruby_api
View on GitHub
A Ruby Gem for interacting with Android API from within Termux
☆18Mar 24, 2019Updated 7 years ago
whull / end2end_ASR
View on GitHub
端到端语音识别实现；包含LAS、CTC、RNNT解码方式，模型SA(MHA)、LSTM、CNN、DFSMN等
☆15Jun 4, 2021Updated 5 years ago
Mitomzhou / ASRT_SR_tensorflow2.0
View on GitHub
基于深度学习识别THCHS30数据集
☆14Oct 27, 2021Updated 4 years ago
msalhab96 / RNN-Transducer
View on GitHub
PyTorch implementation of Sequence Transduction with Recurrent Neural Networks (RNN-T) speech recognition paper
☆16Mar 4, 2022Updated 4 years ago
PristineStream / ChatGPT-Chinese-Tutorial
View on GitHub
ChatGPT中文学习和实践资料汇总——LLaMA、ChatGLM等大模型的Finetune
☆14Updated this week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
deepakbaby / isegan
View on GitHub
Improved Speech Enhancement GANs
☆13Jun 24, 2020Updated 6 years ago
Japan7 / yohane
View on GitHub
Forced alignment for karaokes
☆28Updated this week
renesas-rcar / weston
View on GitHub
The Weston Wayland Compositor
☆15Oct 11, 2018Updated 7 years ago
TXLiao / MulT-TTE
View on GitHub
Source code of the proposed method MulT-TTE in the paper "Multi-faceted Route Representation Learning for Travel Time Estimation"
☆16Apr 7, 2025Updated last year
KrishnaDN / E2E_ASR_Confidence_Estimation
View on GitHub
Implementation of the paper "Confidence estimation for attention based sequence to sequence models for speech recognition"
☆16May 9, 2021Updated 5 years ago
myungcheol / hanja
View on GitHub
한자 사전
☆20Aug 30, 2021Updated 4 years ago
ymoslem / Arabisc
View on GitHub
Context-Sensitive Neural Spelling Checker
☆20Sep 25, 2024Updated last year
Xiaoxiaohuangg / LAS-Chinese-pytorch
View on GitHub
Listen, Attend and Spell - PyTorch Implementation
☆17Dec 28, 2018Updated 7 years ago
shanguanma / Aligners
View on GitHub
HMM, CTC, RNN-Transducer, forward-backward algorithm
☆20Sep 5, 2023Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
aquibali01 / Voice-to-text-and-voice-chatbot
View on GitHub
Voice-to-Voice Chatbot using Whisper, LLaMA, and Groq API
☆18Aug 21, 2024Updated last year
GauSyu / Humphreys
View on GitHub
Solutions for exercises in Humphreys' GTM 9
☆11Feb 18, 2016Updated 10 years ago
Prakashdeveloper03 / Image-Enhancer
View on GitHub
This repository contains source code for Image enhancer can perform various image effects created using OpenCV.
☆10Dec 7, 2022Updated 3 years ago
suhitaghosh10 / emo-stargan
View on GitHub
Implementation of Emo-StarGAN
☆48Dec 19, 2023Updated 2 years ago
phineas-pta / fine-tune-whisper-vi
View on GitHub
jupyter notebooks to fine tune whisper models on Vietnamese using Colab and/or Kaggle and/or AWS EC2
☆19Aug 15, 2025Updated 11 months ago
msalhab96 / AraSpell
View on GitHub
A framework for Arabic spelling correction using different seq2seq model architectures such as transformers and RNNs
☆25Jul 21, 2024Updated 2 years ago
nguyentrungnghia1998 / Reinforcement-Learning-for-Optimal-Feedback-Control-Simulation
View on GitHub
☆18Mar 3, 2023Updated 3 years ago
kyunghyuncho / Foundations_of_LADS
View on GitHub
☆20Jul 10, 2025Updated last year
msalhab96 / Conformer
View on GitHub
An implementation for "Conformer: Convolution-augmented Transformer for Speech Recognition" Paper
☆20Aug 16, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
CODEJIN / XiaoiceSing2
View on GitHub
☆19Feb 2, 2023Updated 3 years ago
lucasgris / wav2vec4bp
View on GitHub
Wav2vec resources and models for Brazilian Portuguese
☆36Jul 15, 2022Updated 4 years ago
ex3ndr / supervoice-librilight-preprocessed
View on GitHub
60k hours of phoneme-aligned audio from audio books
☆19Jul 27, 2024Updated 2 years ago
sukhitashvili / pong
View on GitHub
A reimplementation of Andrej Karpathy's repository for an RL self-learning AI agent that learns to play Pong through trial and error, usi…
☆18Aug 23, 2025Updated 11 months ago
Venomtek / zsysctl-manual-gc
View on GitHub
Manual zsys garbage collection script.
☆18Mar 4, 2021Updated 5 years ago
CodeLinkIO / Vietnamese-text-normalization
View on GitHub
☆17Jul 6, 2023Updated 3 years ago
huanyu-neo / minecraft-neofetch
View on GitHub
Neofetch in minecraft!
☆14May 18, 2024Updated 2 years ago