Picovoice / eagle
On-device speaker recognition engine powered by deep learning
☆27Updated this week
Related projects ⓘ
Alternatives and complementary repositories for eagle
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆84Updated 6 months ago
- On-device streaming text-to-speech engine powered by deep learning☆56Updated 2 weeks ago
- On-device noise suppression powered by deep learning☆63Updated last month
- Open models for Coqui STT☆122Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆179Updated last week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆45Updated 2 weeks ago
- Efficient approach to speaker diarization using voice characteristics extraction☆68Updated 6 months ago
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆40Updated 3 weeks ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆23Updated last month
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆133Updated last year
- On-device speaker diarization powered by deep learning☆25Updated this week
- Create an LJSpeech structured voice dataset on wave input☆21Updated last month
- C++ library for converting text to phonemes for Piper☆89Updated 8 months ago
- A quick and optimized solution to manage llama based gguf quantized models, download gguf files, retreive messege formatting, add more mo…☆12Updated 10 months ago
- Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…☆63Updated this week
- Live transcription with OpenAi Whisper☆50Updated 2 years ago
- Speaker diarization service☆19Updated this week
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" models☆65Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆24Updated last year
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆18Updated last month
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆53Updated 10 months ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆141Updated 6 months ago
- AI-augmented, conversational information retrieval and data exploration☆37Updated 8 months ago
- Speaker diarization model☆20Updated last year
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆110Updated 5 months ago
- Tunable pipelines☆30Updated last month
- ☆87Updated 6 months ago
- Zero-shot Audio Classification using Whisper☆74Updated last year
- Runpod WhisperX Docker Container Repo☆11Updated 8 months ago
- Claudetools is a Python library that enables function calling with the Claude 3 family of language models from Anthropic.☆36Updated 3 months ago