douglas125 / SpeechIdentity
Identity verification from speech
☆18Updated 2 years ago
Alternatives and similar repositories for SpeechIdentity:
Users that are interested in SpeechIdentity are comparing it to the libraries listed below
- Speaker diarization model☆23Updated last year
- Create an LJSpeech structured voice dataset on wave input☆26Updated 4 months ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆190Updated last week
- Efficient approach to speaker diarization using voice characteristics extraction☆88Updated 9 months ago
- Transcription and diarization (speaker identification)☆31Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆57Updated last week
- Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.☆286Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆198Updated this week
- Reproducible experimental protocols for multimedia (audio, video, text) database☆96Updated last week
- An automatic speech recognition API☆49Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆92Updated 9 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆145Updated 9 months ago
- ONNX Inference of Pyannote Segmentation☆80Updated last month
- Finetune VITS and MMS using HuggingFace's tools☆134Updated 10 months ago
- Tools for making LJSpeech datasets☆24Updated last year
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated 11 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆80Updated last year
- ☆273Updated 8 months ago
- Speaker Diarization with Transformers☆64Updated 9 months ago
- On-device speaker diarization powered by deep learning☆38Updated last week
- Collection of Open Source Speech Data☆151Updated 3 months ago
- 🎧 | RunPod worker of the faster-whisper model for Serverless Endpoint.☆84Updated 2 weeks ago
- Diarization scoring tools.☆235Updated last year
- Companion repo for the paper "PixIT: Joint Training of Speaker Diarization and Speech Separation from Real-world Multi-speaker Recordings…☆79Updated last month
- The official Pytorch implementation of "Frame-wise streaming end-to-end speaker diarization with non-autoregressive self-attention-based …☆114Updated this week
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆235Updated 8 months ago
- NVIDIA Riva runnable tutorials☆124Updated this week
- Variational Bayes HMM over x-vectors diarization☆263Updated last year
- On-device speaker recognition engine powered by deep learning☆32Updated this week
- Zero-shot Audio Classification using Whisper☆78Updated 2 years ago