thevasudevgupta / speech-jaxLinks

Speech in Flax/JAX

☆15

Alternatives and similar repositories for speech-jax

Users that are interested in speech-jax are comparing it to the libraries listed below

Sorting:

sanchit-gandhi / seq2seq-speech
Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.
☆36Updated 2 years ago
asappresearch / sew
☆76Updated 3 years ago
frozentoad9 / CMST
Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages
☆13Updated 2 years ago
Open-Speech-EkStep / audio-to-speech-pipeline
This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline
☆32Updated 2 years ago
besacier / ASR2022
☆56Updated 2 years ago
google-deepmind / dm_aux
☆66Updated 10 months ago
google-research / last
A JAX library for building lattice-based speech transducer models
☆45Updated 7 months ago
SarthakYadav / audax
A home for audio ML in JAX. Has common features, learnable frontends, pretrained supervised and self-supervised models.
☆68Updated 2 years ago
egorsmkv / asr-corpus-creator
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Updated last year
skit-ai / speech-to-intent-dataset
Dataset Release for Intent Classification from Speech
☆47Updated 4 months ago
xinjli / asr2k
asr2k
☆51Updated last year
facebookresearch / flashy
Framework for writing deep learning training loops. Lightweight, and retaining full freedom to design as you see fits. It handles checkpo…
☆113Updated last year
salesforce / speech-datasets
Simplified recipes for preparing commonly used speech datasets, and a PyTorch-compatible Python data loader that can perform standard fea…
☆15Updated 2 years ago
facebookresearch / gtn_applications
Applications using the GTN library and code to reproduce experiments in "Differentiable Weighted Finite-State Transducers"
☆84Updated 2 years ago
Edresson / Wav2Vec-Wrapper
An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.
☆82Updated 2 years ago
MiniXC / phones
A collection of utilities for handling IPA phones.
☆25Updated last year
Edresson / SC-GlowTTS
SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model
☆107Updated 3 years ago
jonatasgrosman / asrecognition
ASRecognition: just an easy-to-use library for Automatic Speech Recognition.
☆51Updated 2 years ago
Open-Speech-EkStep / vakyansh-tts
Text to Speech for Indic languages
☆51Updated 3 years ago
krylm / whisper-event-tuning
Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.
☆12Updated 2 years ago
awni / future_speech
The History of Speech Recognition to the Year 2030
☆13Updated 3 years ago
patrickvonplaten / audio-gen-dreambooth
☆23Updated 2 years ago
sayakpaul / BiT-jax2tf
This repository hosts the code to port NumPy model weights of BiT-ResNets to TensorFlow SavedModel format.
☆14Updated 3 years ago
ANLGBOY / WaveNODE
Pytorch Implementation of WaveNODE
☆64Updated 4 years ago
yoyolicoris / variational-diffwave
☆31Updated 2 years ago
hbredin / pyannotebook
🎹 pyannote + 🗒 notebook = pyannotebook
☆26Updated 2 years ago
RF5 / transfusion-asr
Transcribing Speech with Multinomial Diffusion, training code and models.
☆77Updated last year
patrickvonplaten / Wav2Vec2_ParlanceCTCDecode
☆11Updated 3 years ago
bepierre / SpeechVGG
Feature extractor for DL speech processing.
☆66Updated 3 years ago
asappresearch / wav2seq
Official code for Wav2Seq
☆95Updated 2 years ago