bastibe / PySoundFile
DEPRECATED version of SoundFile
☆14Updated 4 years ago
Related projects: ⓘ
- Streamlit app to visualize and edit TTS datasets☆14Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆74Updated last year
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆34Updated last year
- TTS Client for Coqui TTS server☆13Updated last year
- Dataset Release for Intent Classification from Speech☆43Updated last year
- Python C extension for the eSpeak speech synthesizer☆10Updated 3 years ago
- Speech in Flax/JAX☆15Updated 2 years ago
- Simple text to phonemes converter for multiple languages☆21Updated last year
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Finally, some decent sample sentences☆21Updated 9 months ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆35Updated 4 years ago
- Easily turn large sets of audio urls to an audio dataset.☆20Updated last year
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- [DEPRECIATED] [PyTorch 2.0] [638M] [85.33% acc] Full-attention multi-instrumental music transformer for supervised music generation, opti…☆31Updated 9 months ago
- Generate accompaniment part with chords using Evolutionary algorithm.☆8Updated 2 years ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.☆12Updated last year
- Another implementation of the paper "Compound Word Transformer: Learning to Compose Full-Song Music over Dynamic Directed Hypergraphs" in…☆13Updated 3 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆19Updated last year
- Opus codec support for Python.☆25Updated last year
- 🎤 quick library to extract pause lengths from audio files.☆31Updated 5 years ago
- ☆74Updated 2 years ago
- ☆15Updated last year
- Speaker change detection using SincNet and an LSTM/Transformer☆39Updated 2 months ago
- Self-supervised neural network for music recommendations.☆17Updated last year
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downs…☆31Updated 3 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 3 years ago
- Keras(Tensorflow) implementations of Automatic Speech Recognition☆22Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆24Updated last year
- ☆23Updated last year
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 5 years ago