mdangschat/speech-corpus-dl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mdangschat/speech-corpus-dl)

mdangschat / speech-corpus-dl

Download and preperation tool for free speech corpora.

☆16

Alternatives and similar repositories for speech-corpus-dl

Users that are interested in speech-corpus-dl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mdangschat / ctc-asr
View on GitHub
End-to-end trained speech recognition system, based on RNNs and the connectionist temporal classification (CTC) cost function.
☆123Apr 15, 2020Updated 6 years ago
stefanpantic / asr
View on GitHub
Automatic speech recognition using neural networks
☆18Nov 21, 2020Updated 5 years ago
artbataev / end2end
View on GitHub
Losses and decoders for end-to-end ASR and OCR
☆34Oct 30, 2020Updated 5 years ago
dreasysnail / CoCon
View on GitHub
Consistent dialogue generation
☆16Oct 26, 2022Updated 3 years ago
diaoenmao / Speech-Emotion-Recognition-with-Dual-Sequence-LSTM-Architecture
View on GitHub
[ICASSP 2020] Speech Emotion Recognition with Dual-Sequence LSTM Architecture
☆12Jan 17, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
pquochuy / dcase2020-seld
View on GitHub
Source code of the DCASE 2020 SELD submission "Audio Event Detection and Localization with Multitask Regression Network"
☆17Jul 8, 2020Updated 6 years ago
LeeYongHyeok / DCM_vgg_transformer
View on GitHub
Dual cross modality attention audio-visual speech recognition model based on vgg transformer with hybrid CTC/attention architecture using…
☆14Jul 2, 2020Updated 6 years ago
ynop / audiomate
View on GitHub
Python library for handling audio datasets.
☆139Jul 6, 2023Updated 3 years ago
tbornt / phoneme_ctc
View on GitHub
Bidirectional dynamic RNN + CTC for phoneme recognition
☆47Jun 24, 2020Updated 6 years ago
SenJia / Position-Information
View on GitHub
How Much Position Information Do Convolutional Neural Networks Encode?
☆11Sep 20, 2021Updated 4 years ago
nijatmursali / speech-recognition-lstm
View on GitHub
This is the repository for Neural Networks project called Speech Emotion Classification Using Attention-Based LSTM
☆13Apr 30, 2020Updated 6 years ago
chinshr / sctk
View on GitHub
Speech Recognition Scoring Toolkit
☆13Sep 30, 2015Updated 10 years ago
sequence-labeling / rnn-transducer
View on GitHub
An implementation of rnn transducer for sequence labeling problem
☆22Feb 24, 2018Updated 8 years ago
ducha-aiki / hardnet-in-fastai2-and-kornia
View on GitHub
Re-implementation of local descriptor HardNet training in fasta2+kornia
☆21Apr 6, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
RuABraun / texterrors
View on GitHub
☆37Jun 9, 2026Updated last month
sdrobert / pydrobert-kaldi
View on GitHub
SWIG bindings for Kaldi I/O, built with Conda
☆15Dec 15, 2024Updated last year
vagrawal / deepsphinx
View on GitHub
☆19Aug 27, 2018Updated 7 years ago
NinedayWang / Self-Attentive-and-Gated-SLU
View on GitHub
Implementation of paper accepted by EMNLP 2018 using Pytorch named "A Self-Attentive Model with Gate Mechanism for Spoken Language Unders…
☆17Dec 11, 2018Updated 7 years ago
ywatanabe1989 / PyTorch-gaussian-YOLOv3-1D
View on GitHub
A model for event detection from 1D arrays (= vectors) based on gaussian YOLOv3
☆14Jun 26, 2020Updated 6 years ago
SuperKogito / pydiogment
View on GitHub
Python library for audio augmentation
☆84Jul 6, 2023Updated 3 years ago
zh217 / torch-asg
View on GitHub
Auto Segmentation Criterion (ASG) implemented in pytorch
☆51Oct 1, 2021Updated 4 years ago
maelfabien / build_your_blog.github.io
View on GitHub
Template and steps to build your personal blog using Jekyll and Minimal Mistake
☆10Feb 24, 2020Updated 6 years ago
domerin0 / rnn-speech
View on GitHub
Character level speech recognizer using ctc loss with deep rnns in TensorFlow.
☆78Jun 9, 2018Updated 8 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
qpuchen / nnUNet_att_position_correction
View on GitHub
Solution of Team sdkxd for MICCAI 2023 Challenges: STS - Tooth Segmentation Task Based on 3D CBCT
☆24Jan 1, 2024Updated 2 years ago
AppleHolic / pytorch_sound
View on GitHub
Sound Related Deep Learning Tasks boosting repository with pytorch
☆88Jul 25, 2024Updated 2 years ago
z-tufekci / DeepLearning
View on GitHub
☆12May 22, 2022Updated 4 years ago
bertsky / ocrd_publaynet
View on GitHub
convert PubLayNet data into METS/PAGE-XML
☆10Mar 17, 2020Updated 6 years ago
raotnameh / End-to-end-E2E-Named-Entity-Recognition-from-English-Speech
View on GitHub
☆32Dec 2, 2020Updated 5 years ago
clab / knowledge
View on GitHub
☆10Oct 6, 2015Updated 10 years ago
fannn1217 / Results-of-Deep-Learning-in-NLP
View on GitHub
SOTA results for machine learning problems in NLP .
☆11Nov 3, 2020Updated 5 years ago
Sanqiang / dl_research
View on GitHub
My Deep Learning Research Papers
☆14Jan 29, 2017Updated 9 years ago
BackNode / train_Word2Vec
View on GitHub
☆10Nov 5, 2015Updated 10 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
maximeburri / slurmwebapp
View on GitHub
Web application for SLURM cluster
☆20Oct 9, 2016Updated 9 years ago
HawkAaron / RNN-Transducer
View on GitHub
MXNet implementation of RNN Transducer (Graves 2012): Sequence Transduction with Recurrent Neural Networks
☆140Jun 7, 2021Updated 5 years ago
ryan-lowe / Ubuntu-Dialogue-Generationv2
View on GitHub
The better version of Ubuntu Dialogue Corpus
☆16Feb 20, 2016Updated 10 years ago
mayank-git-hub / ETE-Speech-Recognition
View on GitHub
Implementation of Hybrid CTC/Attention Architecture for End-to-End Speech Recognition in pure python and PyTorch
☆26Jul 25, 2024Updated 2 years ago
studio-ousia / textent
View on GitHub
Representation Learning of Entities and Documents from Knowledge Base Descriptions
☆18Oct 6, 2018Updated 7 years ago
kremerj / gan
View on GitHub
A 1D toy example of optimizing a generative model using the WGAN-GP model.
☆25Jul 24, 2017Updated 9 years ago
LHolten / DialoGPT-MMI-decoder
View on GitHub
MMI decoder for DialoGPT and discord bot
☆42Mar 3, 2021Updated 5 years ago