Picovoice / browser-extension
Picovoice Browser Extension
☆14Updated last month
Alternatives and similar repositories for browser-extension:
Users that are interested in browser-extension are comparing it to the libraries listed below
- On-device streaming text-to-speech engine powered by deep learning☆73Updated this week
- benchmark for Speech-to-Intent engines☆15Updated 9 months ago
- On-device speaker recognition engine powered by deep learning☆33Updated this week
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆126Updated 9 months ago
- On-device speaker diarization powered by deep learning☆39Updated this week
- Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.☆27Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learning☆202Updated this week
- ☆42Updated 5 months ago
- Speaker diarization service☆21Updated last month
- 🐸STT integration examples☆126Updated 2 years ago
- On-device noise suppression powered by deep learning☆68Updated this week
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated 11 months ago
- Create an LJSpeech structured voice dataset on wave input☆26Updated 5 months ago
- C++ library for converting text to phonemes for Piper☆111Updated last year
- An automatic speech recognition API☆54Updated this week
- This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4☆20Updated last year
- Talk to GPT-4 and create a story together.☆88Updated last year
- ☆39Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆53Updated 3 months ago
- I have used technologies like Twilio , openai , pinecone , Mongodb, to make an automated calling agent for both inbound and outbound call…☆17Updated 6 months ago
- WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)☆45Updated 8 months ago
- This repo is for Audio Processing Techniques and the Silence Remove using Python☆17Updated 4 years ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆60Updated last week
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆16Updated 6 months ago
- Theraxus AI: A modular conversational AI platform ⚙️ blending STT 🎙️, TTS 🗣️, and RAG 📚 for seamless, context-aware dialogues and huma…☆25Updated 4 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆44Updated 7 months ago
- A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding too…☆123Updated 7 months ago
- Docker images for Coqui AI☆57Updated 3 years ago