TiagoBras / audio-clip-extractor
This utility allows one to cut multiple clips from a single or multiple audio files.
☆19Updated 3 years ago
Alternatives and similar repositories for audio-clip-extractor:
Users that are interested in audio-clip-extractor are comparing it to the libraries listed below
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 5 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- OCTRA is a web-application for the orthographic transcription of audio files.☆37Updated this week
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆26Updated 2 years ago
- DEPRECATED version of SoundFile☆14Updated 4 years ago
- Python C extension for the eSpeak speech synthesizer☆11Updated 4 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented text☆36Updated 4 years ago
- My guide to create an italian TTS with Coqui☆14Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- Wrapper for pydub AudioSegment objects☆96Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆78Updated 2 years ago
- Python library for audio augmentation☆83Updated last year
- Speaker diarization via transfer learning☆27Updated 5 years ago
- A simple streamlit based webapp to process text and correct punctuation built using "fullstop-punctuation-multilang-large" Model from Hug…☆11Updated last year
- LogMMSE speech enhancement/noise reduction☆30Updated 4 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 2 years ago
- A deep learning model is developed which can predict the native country on the basis of the spoken english accent☆47Updated 4 years ago
- [deprecated] Pretrained models for pyannote-audio 1.x☆72Updated 2 years ago
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆23Updated this week
- A gradio interface for making transcribed and translated subtitles for videos☆34Updated this week
- Web-based tool for straight-forward class annotation of audio files☆11Updated 4 years ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆117Updated last year
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- A PyTorch demo of the paper Voice Separation with an Unknown Number of Multiple Speakers using gradio and Nvidia NEMO ASR model.☆35Updated last year
- Whisper combined with Silero VAD, for improved long-form transcriptions☆46Updated 2 years ago
- Tools to create your own voice dataset for TTS training☆66Updated 4 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Updated 5 years ago