linto-ai / linto-diarization
Speaker diarization service
☆21Updated last week
Alternatives and similar repositories for linto-diarization:
Users that are interested in linto-diarization are comparing it to the libraries listed below
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆61Updated this week
- Joint speech-language model - respond directly to audio!☆30Updated 11 months ago
- Open TTS models, built for streaming on the edge☆39Updated 3 weeks ago
- A lightweight Python library for running TTS models with a unified API.☆17Updated last month
- Faster Whisper ASR transcription with CTranslate2☆20Updated 5 months ago
- 🎹 pyannote + 🗒 notebook = pyannotebook☆26Updated last year
- Tunable pipelines☆32Updated last month
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- Speaker Diarization with Transformers☆64Updated 10 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated last year
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated last year
- A multilingual phoneme recognizer capable of generalizing zero-shot to unseen phoneme inventories.☆21Updated 3 weeks ago
- Audio tokenization, in the fastest way possible!☆50Updated 7 months ago
- a simple system for 2-way interruptible voice interactions between human and LLM☆25Updated last year
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆14Updated last month
- Speaker change detection using SincNet and an LSTM/Transformer☆49Updated 9 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.☆17Updated 4 months ago
- Zero-Shot Foreign Accent Conversion without a Native Reference☆33Updated 11 months ago
- C++ version of pyannote audio overlapped speech detection pipeline☆12Updated last year
- On-device speaker diarization powered by deep learning☆43Updated 3 weeks ago
- Voice activity detection and speaker gender segmentation audiovisual corpus☆10Updated 2 months ago
- OpenAI Whisper Prompt Examples☆52Updated last year
- A streaming whisper server for on-prem transcription☆20Updated 7 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago
- A curated list of awesome voice activity detection☆48Updated 4 months ago
- Coqui Inference Engine☆38Updated 3 years ago
- Use quantized versions of Whisper to speed up inference☆12Updated 5 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆81Updated last year
- ☆10Updated last month