bastibe / PySoundFileLinks
DEPRECATED version of SoundFile
β14Updated 5 years ago
Alternatives and similar repositories for PySoundFile
Users that are interested in PySoundFile are comparing it to the libraries listed below
Sorting:
- Text to Speech for Indic languagesβ52Updated 3 years ago
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- β63Updated 4 years ago
- β13Updated 2 years ago
- Zero-shot Audio Classification using Whisperβ79Updated 3 years ago
- Extremely easy to use sequence to sequence library with attention, for text to text conversion tasks.β39Updated 5 years ago
- πAn easy-to-use package to restore punctuation of the text.β119Updated 2 years ago
- Streamlit app to visualize and edit TTS datasetsβ15Updated 3 years ago
- β76Updated 4 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented textβ34Updated 5 years ago
- Transcribing audio files using Hugging Face's implementation of Wav2Vec2 + "chain-linking" NLP tasks to combine speech-to-text with downsβ¦β32Updated 4 years ago
- Simple OGG Vorbis, Opus and FLAC bindings for Pythonβ76Updated last year
- Real-time lossless audio compression in Pythonβ143Updated last year
- A list of scripts/notebooks I'd like to keep handyβ18Updated last year
- Repository for fine-tuning Transformers π€ based seq2seq speech models in JAX/Flax.β38Updated 2 years ago
- Conditional lyrics generator -> pre-trained GPT2 model fine-tuned on lyrics with features dataset.β41Updated 5 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ153Updated last year
- **ARCHIVED** Filesystem interface to π€ Hubβ58Updated 2 years ago
- Python C extension for the eSpeak speech synthesizerβ12Updated 4 years ago
- A python package for whisper normalizerβ70Updated 2 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated 2 years ago
- Automated lyrics-to-audio alignment using syllabic nuclei detection. Developed during Google Summer of Code 2019.β53Updated 2 years ago
- Model for recasing and repunctuating ASR transcriptsβ142Updated last year
- On-device noise suppression powered by deep learningβ77Updated this week
- π Audio and fastai v2β169Updated last year
- A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR modelsβ30Updated 4 years ago
- Experiments with Hugging Face π¬ π€β44Updated last year
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to pβ¦β52Updated 3 years ago
- REPeating Pattern Extraction Technique (REPET) in Python for audio source separation: original REPET, REPET extended, adaptive REPET, REPβ¦β33Updated last year
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.β89Updated 4 years ago