linto-ai / linto-diarization
Speaker diarization service
ā19Updated this week
Related projects ā
Alternatives and complementary repositories for linto-diarization
- Tunable pipelinesā30Updated last month
- š¹ pyannote + š notebook = pyannotebookā25Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.ā71Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.ā45Updated 2 weeks ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.ā25Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.ā27Updated 9 months ago
- Joint speech-language model - respond directly to audio!ā30Updated 6 months ago
- Final training script from HuggingFace Whisper Fine tuning event - to get best results on finetuned model.ā12Updated last year
- Speaker change detection using SincNet and an LSTM/Transformerā44Updated 4 months ago
- ā16Updated 3 years ago
- Speaker Diarization with Transformersā59Updated 6 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.ā40Updated 3 weeks ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperā99Updated last year
- Reproducible experimental protocols for multimedia (audio, video, text) databaseā84Updated last month
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPā¦ā83Updated last month
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Pythonā16Updated last year
- OpenAI Whisper Prompt Examplesā48Updated last year
- This is a fork of the original fairseq repository (version 0.12.2) with added classes for training mHuBERT-147.ā12Updated this week
- Audio tokenization, in the fastest way possible!ā45Updated 2 months ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.ā13Updated last year
- ā11Updated 9 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelizationā25Updated 2 years ago
- Repository for fine-tuning Transformers š¤ based seq2seq speech models in JAX/Flax.ā34Updated last year
- ā54Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesā84Updated 6 months ago
- A JAX library for building lattice-based speech transducer modelsā40Updated 3 weeks ago
- šÆ Speech Recognition Challenge by Speech Lab - IIT Madrasā11Updated 4 years ago
- Various speech datasets made available to the publicā99Updated last month
- Zero-shot Audio Classification using Whisperā74Updated last year
- A streaming whisper server for on-prem transcriptionā17Updated 3 months ago