Picovoice / eagleLinks
On-device speaker recognition engine powered by deep learning
☆35Updated this week
Alternatives and similar repositories for eagle
Users that are interested in eagle are comparing it to the libraries listed below
Sorting:
- On-device streaming text-to-speech engine powered by deep learning☆85Updated this week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last week
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)☆81Updated last year
- On-device speaker diarization powered by deep learning☆47Updated this week
- Using FastChat-T5 Large Language Model, Vosk API for automatic speech recognition, and Piper for text-to-speech☆119Updated last year
- On-device noise suppression powered by deep learning☆70Updated last month
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆57Updated last month
- Speaker diarization service☆23Updated last month
- A curated list of awesome voice activity detection☆55Updated 6 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆94Updated last year
- This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4☆21Updated last year
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆37Updated 6 months ago
- Create an LJSpeech structured voice dataset on wave input☆30Updated 8 months ago
- On-device voice activity detection (VAD) powered by deep learning☆217Updated this week
- Faster Whisper ASR transcription with CTranslate2☆21Updated 7 months ago
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆20Updated 8 months ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated last year
- Pybind11 bindings for Whisper.cpp☆57Updated last week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆95Updated last year
- benchmark for Speech-to-Intent engines☆17Updated last year
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆132Updated 11 months ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆62Updated 7 months ago
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆83Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 5 months ago
- A streaming whisper server for on-prem transcription☆20Updated 9 months ago
- Speech to Speech conversation using the OpenAI RealTime API in Python 🐍☆26Updated 6 months ago
- An automatic speech recognition API☆60Updated last week
- C++ library for converting text to phonemes for Piper