bastibe / PySoundFile
DEPRECATED version of SoundFile
β14Updated 4 years ago
Related projects β
Alternatives and complementary repositories for PySoundFile
- Repository for fine-tuning Transformers π€ based seq2seq speech models in JAX/Flax.β34Updated last year
- β74Updated 3 years ago
- Interface for using TTS and vocoder models in the form of a text editorβ19Updated 2 years ago
- TTS Client for Coqui TTS serverβ13Updated last year
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented textβ36Updated 4 years ago
- β11Updated 5 years ago
- Zero-shot Audio Classification using Whisperβ74Updated last year
- Conditional lyrics generator -> pre-trained GPT2 model fine-tuned on lyrics with features dataset.β40Updated 4 years ago
- πΉ pyannote + π notebook = pyannotebookβ25Updated last year
- Easily turn large sets of audio urls to an audio dataset.β20Updated last year
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 5 years ago
- Python bindings for SoX, aiming to replicate a subset of the command line sox utility.β53Updated 3 years ago
- OpenAI Whisper Prompt Examplesβ48Updated last year
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- Fast and high quality sample-rate conversion library for Pythonβ79Updated last month
- Simple PyTorch Denoisers for Waveform Audioβ32Updated last month
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Spβ¦β12Updated last year
- Dataset Release for Intent Classification from Speechβ46Updated last year
- Another implementation of the paper "Compound Word Transformer: Learning to Compose Full-Song Music over Dynamic Directed Hypergraphs" inβ¦β13Updated 3 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.β30Updated last year
- Speaker change detection using SincNet and an LSTM/Transformerβ44Updated 4 months ago
- π Easily apply audio-related machine learning models trained on the AudioSet dataset (527+ models/classes).β29Updated 5 months ago
- π¦ Nala is an agile open-source voice assistant framework (20+ actions).β35Updated last year
- automatically align transcribed audio and generate a wav2letter training corpusβ35Updated last year
- A python library to find differences between audio and transcriptionsβ15Updated last year
- Quartznet implementation on pytorch [https://arxiv.org/abs/1910.10261]β26Updated 3 years ago
- β32Updated 2 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogniβ¦β24Updated 3 years ago
- πΈTTS recipes for different datasetsβ85Updated 2 years ago
- Experiments with Hugging Face π¬ π€β45Updated 3 months ago