PranavPutsa1006 / Speaker-Diarization
Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python
☆18Updated last year
Alternatives and similar repositories for Speaker-Diarization:
Users that are interested in Speaker-Diarization are comparing it to the libraries listed below
- Speaker diarization service☆21Updated 3 weeks ago
- Reproducible experimental protocols for multimedia (audio, video, text) database☆100Updated 2 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated 3 weeks ago
- Speaker change detection using SincNet and an LSTM/Transformer☆50Updated 10 months ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆83Updated last year
- Prabhupadavani: A Code-mixed Speech Translation Data for 25 languages☆13Updated 2 years ago
- Create an LJSpeech structured voice dataset on wave input☆29Updated 7 months ago
- A free & open tool for transcribing audio interviews with offline ASR support☆24Updated last year
- Tunable pipelines☆33Updated 2 months ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speech…☆17Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆80Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated last year
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- This project is about performing Speaker diarization for Hindi Language.☆49Updated 4 years ago
- ☆11Updated last month
- A simple voice conversion tool☆17Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- PyTorch Implementation of Google's Natural TTS Synthesis by Conditioning WaveNet on Mel Spectrogram Predictions. This implementation supp…☆48Updated last year
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog☆48Updated 11 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- A curated list of awesome voice activity detection☆50Updated 5 months ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆66Updated 2 years ago
- On-device speaker diarization powered by deep learning☆44Updated last month
- a simple system for 2-way interruptible voice interactions between human and LLM☆28Updated last year
- Real-time Speech Separation, Noise Suppression & Speaker Recognition☆18Updated 6 years ago
- asr2k☆50Updated 11 months ago
- Speaker diarization model☆27Updated 2 years ago
- Mispronunciation Detection using a pretrained and finetuned wav2vec2 model for phoneme recognition and diagnosis and feedback using large…☆20Updated 11 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago