georgesterpu/Taris

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/georgesterpu/Taris)

georgesterpu / Taris

Transformer-based online speech recognition system with TensorFlow 2

☆26

Alternatives and similar repositories for Taris

Users that are interested in Taris are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

georgesterpu / avsr-tf1
View on GitHub
Audio-Visual Speech Recognition using Sequence to Sequence Models
☆84Jul 10, 2020Updated 6 years ago
pandeydivesh15 / AVSR-Deep-Speech
View on GitHub
Google Summer of Code 2017 Project: Development of Speech Recognition Module for Red Hen Lab
☆44Aug 29, 2017Updated 8 years ago
smeetrs / deep_avsr
View on GitHub
A PyTorch implementation of the Deep Audio-Visual Speech Recognition paper.
☆244Feb 15, 2024Updated 2 years ago
georgesterpu / pyVSR
View on GitHub
Python toolkit for Visual Speech Recognition
☆37Jun 10, 2020Updated 6 years ago
lzuwei / ip-avsr
View on GitHub
Audio Visual Speech Recognition
☆23Aug 9, 2017Updated 8 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
LeeYongHyeok / DCM_vgg_transformer
View on GitHub
Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…
☆14Jul 2, 2020Updated 6 years ago
sooftware / End-to-End-Speech-Recognition-Models
View on GitHub
PyTorch implementation of automatic speech recognition models.
☆38Jan 10, 2021Updated 5 years ago
lilianemomeni / KWS-Net
View on GitHub
Seeing Wake Words: Audio-visual Keyword Spotting
☆67Sep 16, 2020Updated 5 years ago
mpc001 / end-to-end-lipreading
View on GitHub
Pytorch code for End-to-End Audiovisual Speech Recognition
☆183Nov 18, 2022Updated 3 years ago
iamhankai / voiceMusicSeparation
View on GitHub
Voice Music Separation competing for 6th Huawei Cup in ZJU
☆11Jun 2, 2015Updated 11 years ago
georgid / AlignmentEvaluation
View on GitHub
Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…
☆18Oct 27, 2020Updated 5 years ago
ajinkyaT / Lip_Reading_in_the_Wild_AVSR
View on GitHub
Audio-Visual Speech Recognition using Deep Learning
☆61Nov 14, 2018Updated 7 years ago
staywithme23 / lipreading-by-convolutional-neural-network-keras
View on GitHub
demo code for lip reading
☆21Dec 9, 2016Updated 9 years ago
thu-spmi / SPMILM
View on GitHub
A SPMI Lab toolkit for language models.
☆11Apr 12, 2017Updated 9 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tstafylakis / Lipreading-ResNet
View on GitHub
Torch code for using Residual Networks with LSTMs for Lipreading
☆99Oct 8, 2018Updated 7 years ago
zengchang233 / MTGAN
View on GitHub
MTGAN: Speaker Verification through Multitasking Triplet Generative Adversarial Networks
☆19Feb 29, 2020Updated 6 years ago
voidful / wav2vec2-xlsr-multilingual-56
View on GitHub
56 language, 1 model Multilingual ASR
☆25Jul 25, 2021Updated 4 years ago
kate-egorova / ASR-hybrid-decoding
View on GitHub
This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.…
☆11Feb 4, 2020Updated 6 years ago
ms-dot-k / Visual-Audio-Memory
View on GitHub
PyTorch implementation of "Multi-modality Associative Bridging through Memory: Speech Sound Recollected from Face Video" (ICCV2021)
☆22Apr 11, 2022Updated 4 years ago
ServerSideHannes / las
View on GitHub
tf 2.0 implementation of Listen, attend and spell
☆21Jan 19, 2021Updated 5 years ago
hchung12 / espnet-asr
View on GitHub
☆37Dec 23, 2020Updated 5 years ago
ian-k-1217 / Fully-Generalized-Non-Local-Network
View on GitHub
☆10Jun 2, 2021Updated 5 years ago
mpc001 / Lipreading_using_Temporal_Convolutional_Networks
View on GitHub
ICASSP'22 Training Strategies for Improved Lip-Reading; ICASSP'21 Towards Practical Lipreading with Distilled and Efficient Models; ICASS…
☆437May 18, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
YoungloLee / tf2-speech-recognition-transformer
View on GitHub
Tensorflow 2 Speech Recognition Code (Transformer)
☆25Jun 29, 2020Updated 6 years ago
afperezm / acoustic-images-distillation
View on GitHub
Code for the paper: Audio-Visual Model Distillation Using Acoustic Images
☆21Mar 24, 2023Updated 3 years ago
audioku / meta-transfer-learning
View on GitHub
Implementation of meta-transfer-learning for ASR and LM (ACL 2020)
☆52Jul 30, 2020Updated 5 years ago
idiap / apam
View on GitHub
APAM toolkit is built on PyTorch and provides recipes to adapt pretrained acoustic models with a variety of sequence discriminative train…
☆14Feb 15, 2021Updated 5 years ago
ms-dot-k / LRW_ID
View on GitHub
The speaker-labeled information of LRW dataset, which is the outcome of the paper "Speaker-adaptive Lip Reading with User-dependent Paddi…
☆10Oct 12, 2023Updated 2 years ago
lijuntaopku / UFD
View on GitHub
Code for Unsupervised Domain Adaptation of a Pretrained Cross-Lingual Language Model, IJCAI 2020
☆12Nov 26, 2020Updated 5 years ago
dansoutner / kaldi2htk
View on GitHub
Script for converting kaldi GMM/HMM models to HTK format
☆11Jul 18, 2024Updated 2 years ago
HLTCHKUST / CI-AVSR
View on GitHub
Code repository for the Cantonese In-car Audio-Visual Speech Recognition (CI-AVSR) dataset.
☆42Jul 16, 2024Updated 2 years ago
JRMeyer / multi-task-kaldi
View on GitHub
An example directory for running Multi-Task Learning training on Kaldi neural networks. In Kaldi-speak, this is an egs dir for nnet3 trai…
☆55Jan 2, 2020Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Songlin-Dong / KRT-MLCIL
View on GitHub
☆12Apr 16, 2024Updated 2 years ago
foriamweak / FastAffineMotion
View on GitHub
Official git for "Fast Affine Motion Estimation for Versatile Video Coding (VVC) Encoding"
☆11Sep 14, 2020Updated 5 years ago
BitFloyd / Shot_Segmentation
View on GitHub
Project to segment video stream into separate shots
☆13Oct 30, 2018Updated 7 years ago
asappresearch / multistream-cnn
View on GitHub
Multistream CNN for Robust Acoustic Modeling
☆40Jun 17, 2021Updated 5 years ago
cywang97 / StreamingTransformer
View on GitHub
☆277Jan 15, 2021Updated 5 years ago
HaoranMiao / streaming-attention
View on GitHub
streaming attention networks for end-to-end automatic speech recognition
☆56May 6, 2020Updated 6 years ago
sooftware / speech-transformer
View on GitHub
Transformer implementation speciaized in speech recognition tasks using Pytorch.
☆65Nov 28, 2021Updated 4 years ago