bastibe / PySoundFileLinks
DEPRECATED version of SoundFile
☆14Updated 5 years ago
Alternatives and similar repositories for PySoundFile
Users that are interested in PySoundFile are comparing it to the libraries listed below
Sorting:
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.☆36Updated 2 years ago
- A python library to find differences between audio and transcriptions☆20Updated last year
- Streamlit app to visualize and edit TTS datasets☆14Updated 3 years ago
- Experiments with Hugging Face 🔬 🤗☆44Updated 9 months ago
- Easily turn large sets of audio urls to an audio dataset.☆21Updated 2 years ago
- Dataset Release for Intent Classification from Speech☆46Updated 3 months ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- Zero-shot Audio Classification using Whisper☆79Updated 2 years ago
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆13Updated 2 years ago
- Wrapper for pydub AudioSegment objects☆96Updated 2 years ago
- Self-supervised neural network for music recommendations.☆18Updated last year
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models☆31Updated 4 years ago
- A 🔥 cookiecutter template for building Hugging Face Spaces☆11Updated 3 years ago
- Tunable pipelines☆34Updated 3 months ago
- Rhyme with AI☆44Updated 4 years ago
- Keras(Tensorflow) implementations of Automatic Speech Recognition☆23Updated 3 years ago
- Python C extension for the eSpeak speech synthesizer☆11Updated 4 years ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆13Updated 2 years ago
- TorchServe+Streamlit for easily serving your HuggingFace NER models☆33Updated 2 years ago
- ☆10Updated last year
- ☆103Updated last week
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 3 months ago
- docker for HF wav2vec2-sprint☆13Updated 4 years ago
- ☆76Updated 3 years ago
- [DEPRECATED] Audio Module for fastai v2☆65Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 4 years ago
- Compute useful transcriptions metrics (CER, WER, SER, ...)☆27Updated 10 years ago