ngbala6 / Audio-ProcessingLinks
This repo is for Audio Processing Techniques and the Silence Remove using Python
☆17Updated 4 years ago
Alternatives and similar repositories for Audio-Processing
Users that are interested in Audio-Processing are comparing it to the libraries listed below
Sorting:
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- ☆40Updated last year
- Model for recasing and repunctuating ASR transcripts☆133Updated last year
- Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech Recognition for Welsh with HuggingFace☆14Updated 2 years ago
- Keras(Tensorflow) implementations of Automatic Speech Recognition☆23Updated 3 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 3 months ago
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆23Updated 3 months ago
- Traditional ASR (Signal & Cepstral Analysis, DTW, HMM) & DNNs (Custom Models + DeepSpeech) on Indian Accent Speech☆92Updated last year
- SoundPy (alpha stage) is a research-based python package for speech and sound. Applications include deep-learning, filtering, speech-enha…☆74Updated 4 months ago
- A python package for whisper normalizer☆60Updated 3 weeks ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆137Updated last year
- Some simple wrappers around eSpeak NG intended to make using this excellent TTS for waveform and IPA generation as convenient as possible…☆42Updated 8 months ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.☆25Updated 2 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆26Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- This project is about performing Speaker diarization for Hindi Language.☆50Updated 4 years ago
- ⇨ The Speaker Recognition System consists of two phases, Feature Extraction and Recognition. ⇨ In the Extraction phase, the Speaker's vo…☆38Updated 5 years ago
- Web app for keyword spotting using TensorflowJS☆71Updated 2 years ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipeline☆32Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆149Updated last year
- End-to-end spoken language identification out of the box.☆48Updated 4 years ago
- Speaker diarization python system based on binary key speaker modelling☆61Updated 3 years ago
- Adapting your own Language Model for Kaldi☆63Updated 6 years ago
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 4 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Command line tool to create corpora for Common Voice☆76Updated last year
- ☆41Updated 2 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisper☆114Updated 2 years ago