ngbala6 / Audio-Processing
This repo is for Audio Processing Techniques and the Silence Remove using Python
β16Updated 4 years ago
Alternatives and similar repositories for Audio-Processing:
Users that are interested in Audio-Processing are comparing it to the libraries listed below
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.β25Updated 2 years ago
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- πΈTTS recipes for different datasetsβ85Updated 2 years ago
- Official Repository of the Deep Diacritization Paperβ16Updated 4 years ago
- A python package for whisper normalizerβ47Updated 2 months ago
- π« check your data, before you wreck your modelβ16Updated 2 years ago
- This project is about performing Speaker diarization for Hindi Language.β48Updated 3 years ago
- A free & open tool for transcribing audio interviews with offline ASR supportβ24Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated last year
- Speaker diarization python system based on binary key speaker modellingβ61Updated 3 years ago
- πΈSTT integration examplesβ125Updated 2 years ago
- Adnabod lleferydd Cymraeg i'r Gymraeg gyda HuggingFace // Speech Recognition for Welsh with HuggingFaceβ14Updated 2 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 3 years ago
- β43Updated 2 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.β81Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) databaseβ96Updated last week
- A crash course for training speech recognition models using DeepSpeech.β24Updated 3 years ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.β80Updated last year
- End-to-end spoken language identification out of the box.β48Updated 4 years ago
- Speaker change detection using SincNet and an LSTM/Transformerβ46Updated 7 months ago
- Linguistic processing for Common Voiceβ53Updated last year
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented textβ36Updated 4 years ago
- β39Updated last year
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 5 years ago
- Tools to create your own voice dataset for TTS trainingβ66Updated 4 years ago
- Model for recasing and repunctuating ASR transcriptsβ133Updated 10 months ago
- [deprecated] Pretrained models for pyannote-audio 1.xβ72Updated 2 years ago
- A complete speech segmentation system using Kaldi and x-vectors for voice activity detection (VAD) and speaker diarisation.β27Updated 8 months ago
- This repository contains all the codes used in a thesis at Information Technology University (ITU). The topic of the thesis is pronunciatβ¦β25Updated 5 years ago
- SoundPy (alpha stage) is a research-based python package for speech and sound. Applications include deep-learning, filtering, speech-enhaβ¦β73Updated last month