pandeydivesh15/AVSR-Deep-Speech

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pandeydivesh15/AVSR-Deep-Speech)

pandeydivesh15 / AVSR-Deep-Speech

Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab

☆44

Alternatives and similar repositories for AVSR-Deep-Speech

Users that are interested in AVSR-Deep-Speech are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lzuwei / ip-avsr
View on GitHub
Audio Visual Speech Recognition
☆23Aug 9, 2017Updated 8 years ago
georgesterpu / Taris
View on GitHub
Transformer-based online speech recognition system with TensorFlow 2
☆26Jan 22, 2021Updated 5 years ago
matthijsvk / multimodalSR
View on GitHub
Multimodal speech recognition using lipreading (with CNNs) and audio (using LSTMs). Sensor fusion is done with an attention network.
☆69Nov 19, 2022Updated 3 years ago
ajinkyaT / Lip_Reading_in_the_Wild_AVSR
View on GitHub
Audio-Visual Speech Recognition using Deep Learning
☆61Nov 14, 2018Updated 7 years ago
LCAV / localization-icassp2018
View on GitHub
Code of paper "Combining range and direction for improved localization" presented at ICASSP2018
☆10Apr 20, 2018Updated 8 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
georgesterpu / avsr-tf1
View on GitHub
Audio-Visual Speech Recognition using Sequence to Sequence Models
☆84Jul 10, 2020Updated 6 years ago
TTS-cdac-mumbai / TBT
View on GitHub
☆14May 7, 2019Updated 7 years ago
patyork / AutomaticSpeechChunker
View on GitHub
From a large speech audio file and its corresponding body of text, automatically chunk the audio and text into (phrase, audio_snippet) pa…
☆17May 15, 2015Updated 11 years ago
candlewill / Ossian
View on GitHub
Ossian: A simple language-independent Text-to-speech frontend
☆17Mar 1, 2018Updated 8 years ago
luomingshuang / k2-speechbrain
View on GitHub
In this repository, I try to combine k2 with speechbrain to decode well and fastly.
☆16Jun 17, 2022Updated 4 years ago
barthez / speaker-recognition-nn
View on GitHub
Speaker Recognition application using fast-forward NN
☆16Jun 14, 2012Updated 14 years ago
JarbasAl / kaldi_spotter
View on GitHub
wake word spotting with kaldi
☆19Dec 3, 2020Updated 5 years ago
yongxuUSTC / DNN-SpeechEnhancement
View on GitHub
DNN-based speech enhancement using Tensorflow by Haoyu Li (Tokyo univ.)
☆17Aug 31, 2017Updated 8 years ago
BUTSpeechFIT / ASR-hybrid-decoding
View on GitHub
☆17Nov 25, 2019Updated 6 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
brianlan / automatic-speech-recognition
View on GitHub
Automatic Speech Recognition using Tensorflow
☆46Aug 9, 2017Updated 8 years ago
RedHenLab / Audio
View on GitHub
Tools for parsing the audio track in television news programs
☆19Apr 24, 2021Updated 5 years ago
DeepLearn-lab / Acoustic-Feature-Fusion_Chime18
View on GitHub
Code for our paper "Acoustic Features Fusion using Attentive Multi-channel Deep Architecture" in Keras and tensorflow
☆26Nov 23, 2018Updated 7 years ago
someonefighting / tf-kaldi-speaker-master
View on GitHub
☆17Jun 30, 2020Updated 6 years ago
RedHenLab / ASR-for-Chinese-Pipeline
View on GitHub
Google Summer of Code 2018 Project: Automatic Speech Recognition for Speech-to-Text on Chinese
☆10Jan 11, 2019Updated 7 years ago
jacks205 / Spell-Check
View on GitHub
Spell Checker in Python
☆10Nov 11, 2013Updated 12 years ago
danFromTelAviv / key_words_spotting
View on GitHub
implementation of "EFFICIENT KEYWORD SPOTTING USING DILATED CONVOLUTIONS AND GATING"
☆38Dec 8, 2019Updated 6 years ago
RedHenLab / Gesture
View on GitHub
☆10Jan 27, 2017Updated 9 years ago
type-a / speechnet
View on GitHub
Automatic Speech Recognition
☆20Aug 24, 2018Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
charlesliucn / LanMIT
View on GitHub
📖 LanMIT: A Toolkit for Improving Language Models in Low-resourced Speech Recognition based on Kaldi.
☆22Jul 12, 2019Updated 7 years ago
hirofumi0810 / asr_preprocessing
View on GitHub
Python implementation of pre-processing for End-to-End speech recognition
☆70Feb 19, 2018Updated 8 years ago
kastnerkyle / research_megarepo
View on GitHub
A monster repo for random research, not organized in any particular way
☆13Dec 27, 2016Updated 9 years ago
rakshithShetty / dnn-speech
View on GitHub
This ist the repository for the term project Speech Recognition using Deep Neural Networks for the course ELEC-E5510-Speech Recognition
☆12Dec 8, 2015Updated 10 years ago
david-ryan-snyder / kaldi
View on GitHub
This is now the official location of the Kaldi project.
☆10Aug 22, 2019Updated 6 years ago
RicherMans / AudioCaption
View on GitHub
Dataset and baseline for the first Audiocaption task
☆79Jul 25, 2024Updated last year
DistantSpeechRecognition / sweethomelisten
View on GitHub
☆17Apr 8, 2016Updated 10 years ago
idnavid / speech_activity_detection
View on GitHub
Unsupervised speech activity detection system.
☆11Jul 2, 2018Updated 8 years ago
Hannes1 / react-native-wenet
View on GitHub
Wenet speech to text for react native
☆10Nov 1, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
uiuc-sst / asr24
View on GitHub
24-hour Automatic Speech Recognition
☆27Jun 4, 2021Updated 5 years ago
anicolson / matlab_feat
View on GitHub
Functions for creating speech features in MATLAB.
☆14Jul 7, 2020Updated 6 years ago
tachi-hi / tts_samples
View on GitHub
Demo page of our paper Efficiently Trainable Text-to-Speech System Based on Deep Convolutional Networks With Guided Attention, ICASSP 201…
☆15May 30, 2021Updated 5 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
i3thuan5 / hts_engine_python
View on GitHub
python wrap for hts engine
☆14Jan 30, 2018Updated 8 years ago
kusha / voiceid
View on GitHub
Speaker recognition/identification system in Python. Python3 port.
☆14May 2, 2015Updated 11 years ago
usc-sail / barista
View on GitHub
Barista is an open-source framework for concurrent speech processing.
☆36Mar 19, 2014Updated 12 years ago