PranavPutsa1006 / Speaker-Diarization
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
☆18Updated last year
Alternatives and similar repositories for Speaker-Diarization:
Users that are interested in Speaker-Diarization are comparing it to the libraries listed below
- OCTRA is a web-application for the orthographic transcription of audio files.☆37Updated this week
- Speaker change detection using SincNet and an LSTM/Transformer☆46Updated 7 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 3 years ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated last year
- ☆56Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- asr2k☆49Updated 8 months ago
- Audio processing using deep neural networks. Speaker identification using voice embeddings.☆13Updated 2 years ago
- Real-time Speech Separation, Noise Suppression & Speaker Recognition☆18Updated 5 years ago
- Speaker diarization service☆21Updated this week
- A mini, simple, and fast end-to-end automatic speech recognition toolkit.☆50Updated 2 years ago
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆96Updated last week
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Swarah: Indian-English speech dataset collected across the country☆27Updated last year
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog☆46Updated 9 months ago
- Uses machine learning to denoise audio containing speech☆31Updated 7 months ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆29Updated 9 months ago
- Code for AccentDB.☆20Updated 3 years ago
- Tunable pipelines☆31Updated last week
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 5 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆37Updated 2 years ago
- Similarity Learning applied to Speaker Verification and Semantic Textual Similarity☆12Updated 4 years ago
- A simple voice conversion tool☆17Updated 2 years ago
- Deep Learning model for lexical stress detection in spoken English☆27Updated 4 years ago
- A python package for whisper normalizer☆47Updated 2 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago