Picovoice / eagle
On-device speaker recognition engine powered by deep learning
☆24Updated last week
Related projects: ⓘ
- On-device noise suppression powered by deep learning☆59Updated 2 weeks ago
- On-device streaming text-to-speech engine powered by deep learning☆43Updated last week
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆34Updated last week
- Create an LJSpeech structured voice dataset on wave input☆16Updated 2 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆81Updated 4 months ago
- On-device voice activity detection (VAD) powered by deep learning☆165Updated 2 weeks ago
- Your one-stop solution for voice dataset creation☆106Updated 9 months ago
- This package is the Python implementation of Deepgram's WebVTT and SRT formatting. Given a transcription, this package can return a valid…☆15Updated 7 months ago
- 🐸Coqui Dialogue Audio Pack contains more than 2000 audio files of synthetic human voices over dialogue created specifically for video ga…☆39Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.☆132Updated last year
- Coqui AI TTS plugin☆65Updated last week
- Zero-shot Audio Classification using Whisper☆74Updated last year
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of code☆132Updated 4 months ago
- whisper.cpp bindings for python☆68Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆24Updated last year
- Code for OpenAI Whisper Web App Demo☆95Updated last year
- Tools for making LJSpeech datasets☆17Updated 7 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆56Updated 4 months ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆24Updated last year
- A VoiceAsistant with WhisperAI speech recognition☆28Updated 2 weeks ago
- Speaker diarization model☆18Updated last year
- Transcription and diarization (speaker identification)☆26Updated last year
- ☆62Updated 4 months ago
- Open models for Coqui STT☆119Updated last year
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆194Updated 3 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆119Updated 2 months ago
- A python library to find differences between audio and transcriptions☆14Updated 10 months ago
- 😎 Awesome lists about Speech Emotion Recognition☆57Updated this week
- C++ library for converting text to phonemes for Piper☆78Updated 6 months ago
- This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4☆18Updated 9 months ago