assafmu/wav2letter_pytorch

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/assafmu/wav2letter_pytorch)

assafmu / wav2letter_pytorch

An implementation of the Wav2Letter Speech-to-Text model using PyTorch.

☆14

Alternatives and similar repositories for wav2letter_pytorch

Users that are interested in wav2letter_pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
FixelAlgorithmsTeam / FixelCourses
View on GitHub
Repository dedicated to Fixel Courses (Education)
☆19Jul 21, 2026Updated last week
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
Richard-Burns / PerceptionNeuronTouchDesigner
View on GitHub
A component for bringing Noitoms perception neuron data into TouchDesigner and auto-rigging characters.
☆11Apr 27, 2020Updated 6 years ago
HoloLabInc / AzureKinectSharp
View on GitHub
Azure Kinect SDK C# Wrapper
☆16Jul 25, 2019Updated 7 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
termoxin / subtitles-parallelizer
View on GitHub
Library to parallelize subtitles (.srt)
☆14Jan 6, 2023Updated 3 years ago
aispeech-lab / TinyWASE
View on GitHub
PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…
☆11Jun 28, 2021Updated 5 years ago
8400TheHealthNetwork / HebSpacy
View on GitHub
Hebrew oriented NER spaCy pipeline
☆20Aug 8, 2024Updated last year
m-i-k-i / face_recognition
View on GitHub
The world's simplest facial recognition api for Python and the command line
☆11Feb 2, 2020Updated 6 years ago
idanmoradarthas / DataScienceUtils
View on GitHub
Data Science Utils: Frequently Used Methods for Data Science
☆37Jun 6, 2026Updated last month
satoruhiga / TouchDesigner-FrameDelayCalculator
View on GitHub
☆13Dec 2, 2018Updated 7 years ago
geeknam / py-xiaomi-home
View on GitHub
Pythonic bindings for Xiaomi Smart Home Suite
☆13Jan 15, 2017Updated 9 years ago
schufo / tisms
View on GitHub
This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"
☆16Apr 8, 2024Updated 2 years ago
rystylee / pix2pix-Next-Frame-Prediction
View on GitHub
pix2pix-Next-Frame-Prediction generates video by recursively generating images with pix2pix.
☆32Nov 2, 2018Updated 7 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
limitium / arduino_builder
View on GitHub
Ubuntu arduino uploader and serial monitor
☆11Feb 22, 2017Updated 9 years ago
fgnt / paderbox
View on GitHub
Paderbox: A collection of utilities for audio / speech processing
☆43Jul 21, 2025Updated last year
spiraltechnica / Neural-VJ
View on GitHub
A simple, concise tensorflow implementation of fast style transfer with Spout support for VJ enabled texture sharing
☆15Sep 4, 2017Updated 8 years ago
mkobuolys / flutter-firebase-remote-config-demo
View on GitHub
Flutter + Firebase Remote Config demo project.
☆20Feb 6, 2023Updated 3 years ago
Rookout / piper
View on GitHub
MultiBranch Pipeline For Argo Workflows
☆37May 15, 2024Updated 2 years ago
jbeliao / SLAM
View on GitHub
☆16Sep 12, 2019Updated 6 years ago
satoruhiga / TouchDesigner-FFMPEG_Pipe
View on GitHub
☆17Jan 29, 2020Updated 6 years ago
krother / refactoring_tutorial
View on GitHub
The refactoring tutorial I wrote for PyConDE 2022. You can also work through the exercises on your own.
☆18Apr 22, 2024Updated 2 years ago
ICASSP2021-tutorial9 / Distant_conversational_ASR_and_analysis
View on GitHub
☆12Jun 10, 2021Updated 5 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
popcornell / MicRank
View on GitHub
MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.
☆22Apr 8, 2021Updated 5 years ago
etzinis / fedenhance
View on GitHub
Code for the paper: Separate but togerher: Unsupervised Federated Learning for Speech Enhancement from non-iid data
☆41Nov 1, 2021Updated 4 years ago
tommy-fox / streaming-source-separation
View on GitHub
Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.
☆21Dec 8, 2022Updated 3 years ago
vvestman / pytorch-ivectors
View on GitHub
GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…
☆63Oct 15, 2019Updated 6 years ago
RobertBrunhage / flutter_twitter_app_tutorial
View on GitHub
☆19Dec 5, 2020Updated 5 years ago
Cloud-CV / diverse-beam-search
View on GitHub
Decoding Diverse Solutions from Neural Sequence Models
☆78Aug 13, 2018Updated 7 years ago
onolab-tmu / code_2020ICASSP_iss
View on GitHub
Code to reproduce the experiments in the paper "Fast and stable blind source separation with rank-1 updates" presented at ICASSP 2020.
☆22Apr 14, 2020Updated 6 years ago
xavierfav / coala
View on GitHub
COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations
☆48Jul 25, 2024Updated 2 years ago
doubleshow / libsikuli
View on GitHub
c++ library for Sikuli
☆17Jan 4, 2011Updated 15 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
camikura / touchdesigner-atem-chop
View on GitHub
Control Blackmagic Design ATEM from TouchDesigner's CHOP Operator
☆18Apr 8, 2022Updated 4 years ago
pilot7747 / VoxDIY
View on GitHub
This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.
☆16Jul 22, 2021Updated 5 years ago
8400TheHealthNetwork / HebSafeHarbor
View on GitHub
Hebrew PHI identification and redaction toolkit
☆21Mar 21, 2024Updated 2 years ago
NoaCahan / WavenetAutoEncoder
View on GitHub
pytorch implementation of wavenet autoencoder https://arxiv.org/pdf/1704.01279.pdf
☆12Jul 25, 2018Updated 8 years ago
whizzzkid / rpi-ws281x-matrix-python
View on GitHub
WS281x LED Matrix Image Rendering Library
☆18Aug 12, 2019Updated 6 years ago
Norod / hebrew-gpt_neo
View on GitHub
Hebrew text generation models based on EleutherAI's gpt-neo. Each was trained on a TPUv3-8 made avilable via TPU Research Cloud Program.
☆22Jul 6, 2022Updated 4 years ago
n3n / hasura-cloud-run
View on GitHub
Deploy Hasura on Cloud Run
☆30Jul 21, 2020Updated 6 years ago