An implementation of the Wav2Letter Speech-to-Text model using PyTorch.
☆14Mar 8, 2023Updated 3 years ago
Alternatives and similar repositories for wav2letter_pytorch
Users that are interested in wav2letter_pytorch are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Enable RNNLM lattice rescoring with Pytorch [kaldi]☆12Jun 5, 2020Updated 5 years ago
- Repository dedicated to Fixel Courses (Education)☆17Mar 20, 2026Updated last week
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- A component for bringing Noitoms perception neuron data into TouchDesigner and auto-rigging characters.☆11Apr 27, 2020Updated 5 years ago
- Azure Kinect SDK C# Wrapper☆16Jul 25, 2019Updated 6 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Data Science Utils: Frequently Used Methods for Data Science☆37Updated this week
- <In Development> Transformers for Keras that support sklearn's .fit .predict .☆30Jun 23, 2020Updated 5 years ago
- PyTorch implementation of TinyWASE described in our paper "Compressing Speaker Extraction Model with Ultra-low Precision Quantization and…☆11Jun 28, 2021Updated 4 years ago
- Library to parallelize subtitles (.srt)☆14Jan 6, 2023Updated 3 years ago
- The world's simplest facial recognition api for Python and the command line☆11Feb 2, 2020Updated 6 years ago
- Hebrew oriented NER spaCy pipeline☆21Aug 8, 2024Updated last year
- Pythonic bindings for Xiaomi Smart Home Suite☆13Jan 15, 2017Updated 9 years ago
- This is the code of the ICASSP 2020 paper "Joint phoneme alignment and text-informed speech separation on highly corrupted speech"☆15Apr 8, 2024Updated last year
- ☆13Dec 2, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Ubuntu arduino uploader and serial monitor☆11Feb 22, 2017Updated 9 years ago
- pix2pix-Next-Frame-Prediction generates video by recursively generating images with pix2pix.☆33Nov 2, 2018Updated 7 years ago
- Hebrew PHI identification and redaction toolkit☆20Mar 21, 2024Updated 2 years ago
- Paderbox: A collection of utilities for audio / speech processing☆43Jul 21, 2025Updated 8 months ago
- A simple, concise tensorflow implementation of fast style transfer with Spout support for VJ enabled texture sharing☆15Sep 4, 2017Updated 8 years ago
- Flutter + Firebase Remote Config demo project.☆19Feb 6, 2023Updated 3 years ago
- The refactoring tutorial I wrote for PyConDE 2022. You can also work through the exercises on your own.☆18Apr 22, 2024Updated last year
- ☆12Jun 10, 2021Updated 4 years ago
- GPU accelerated implementation of i-vector extractor training using PyTorch. Requires Kaldi for feature extraction and UBM training. An e…☆63Oct 15, 2019Updated 6 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- pytorch implementation of wavenet autoencoder https://arxiv.org/pdf/1704.01279.pdf☆12Jul 25, 2018Updated 7 years ago
- MicRank is a Learning to Rank neural channel selection framework where a DNN is trained to rank microphone channels.☆22Apr 8, 2021Updated 4 years ago
- ☆16Sep 12, 2019Updated 6 years ago
- Streaming source separation for music and speech files, using the Open-Unmix LSTM architecture.☆21Dec 8, 2022Updated 3 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- MultiBranch Pipeline For Argo Workflows☆37May 15, 2024Updated last year
- Decoding Diverse Solutions from Neural Sequence Models☆77Aug 13, 2018Updated 7 years ago
- Code to reproduce the experiments in the paper "Fast and stable blind source separation with rank-1 updates" presented at ICASSP 2020.☆21Apr 14, 2020Updated 5 years ago
- Control Blackmagic Design ATEM from TouchDesigner's CHOP Operator☆16Apr 8, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- COALA: Co-Aligned Autoencoders for Learning Semantically Enriched Audio Representations☆48Jul 25, 2024Updated last year
- WS281x LED Matrix Image Rendering Library☆18Aug 12, 2019Updated 6 years ago
- Hebrew text generation models based on EleutherAI's gpt-neo. Each was trained on a TPUv3-8 made avilable via TPU Research Cloud Program.☆22Jul 6, 2022Updated 3 years ago
- Code for the Paper Speech Recognition and Multi-Speaker Diarization of Long Conversations☆38Jun 12, 2023Updated 2 years ago
- ☆19Dec 5, 2020Updated 5 years ago
- Sample .toe file using OpenPose on TouchDesigner☆16Dec 1, 2018Updated 7 years ago
- Deploy Hasura on Cloud Run☆30Jul 21, 2020Updated 5 years ago