bastibe / PySoundFileLinks
DEPRECATED version of SoundFile
β14Updated 5 years ago
Alternatives and similar repositories for PySoundFile
Users that are interested in PySoundFile are comparing it to the libraries listed below
Sorting:
- Text to Speech for Indic languagesβ52Updated 3 years ago
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented textβ34Updated 5 years ago
- Parse and convert numbers written in French, English, Spanish, Portuguese, German and Catalan into their digit representation.β112Updated 2 weeks ago
- Zero-shot Audio Classification using Whisperβ79Updated 3 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ27Updated 3 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downsβ¦β32Updated 4 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ154Updated last year
- Model for recasing and repunctuating ASR transcriptsβ143Updated last year
- Real-time lossless audio compression in Pythonβ143Updated last year
- β76Updated 4 years ago
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR modelsβ30Updated 4 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated 2 years ago
- Advanced data structures for handling temporal segments with attached labels.β124Updated 4 months ago
- A list of scripts/notebooks I'd like to keep handyβ18Updated last year
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to pβ¦β52Updated 3 years ago
- πAn easy-to-use package to restore punctuation of the text.β119Updated 2 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 6 years ago
- Small repo describing how to use Hugging Face's Wav2Vec2 with PyCTCDecodeβ111Updated 3 years ago
- Interface for using TTS and vocoder models in the form of a text editorβ19Updated 2 months ago
- TTS Client for Coqui TTS serverβ13Updated 3 years ago
- On-device noise suppression powered by deep learningβ81Updated last week
- An in-browser app for labeling audio clips at random, using Docker and Flask.β53Updated 8 years ago
- A crash course for training speech recognition models using DeepSpeech.β24Updated 4 years ago
- Python library for handling audio datasets.β138Updated 2 years ago
- A tokenizer, text cleaner, and phonemizer for many human languages.β332Updated last year
- β13Updated 3 years ago
- Repository for fine-tuning Transformers π€ based seq2seq speech models in JAX/Flax.β38Updated 2 years ago
- π« check your data, before you wreck your modelβ16Updated 3 years ago
- Experiments with Hugging Face π¬ π€β45Updated last year