Picovoice / browser-extension

Picovoice Browser Extension

☆14

Alternatives and similar repositories for browser-extension:

Users that are interested in browser-extension are comparing it to the libraries listed below

Picovoice / orca
On-device streaming text-to-speech engine powered by deep learning
☆73Updated this week
Picovoice / speech-to-intent-benchmark
benchmark for Speech-to-Intent engines
☆15Updated 9 months ago
Picovoice / eagle
On-device speaker recognition engine powered by deep learning
☆33Updated this week
lalanikarim / webrtc-ai-voice-chat
A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.
☆126Updated 9 months ago
Picovoice / falcon
On-device speaker diarization powered by deep learning
☆39Updated this week
solyarisoftware / CoquiSTTJs
Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.
☆27Updated 3 years ago
Picovoice / cobra
On-device voice activity detection (VAD) powered by deep learning
☆202Updated this week
KingNish24 / Realtime-whisper-large-v3-turbo
☆42Updated 5 months ago
linto-ai / linto-diarization
Speaker diarization service
☆21Updated last month
coqui-ai / STT-examples
🐸STT integration examples
☆126Updated 2 years ago
Picovoice / koala
On-device noise suppression powered by deep learning
☆68Updated this week
KoljaB / WhoSpeaks
Efficient approach to speaker diarization using voice characteristics extraction
☆92Updated 11 months ago
thorstenMueller / Audio-to-Voice-Dataset
Create an LJSpeech structured voice dataset on wave input
☆26Updated 5 months ago
rhasspy / piper-phonemize
C++ library for converting text to phonemes for Piper
☆111Updated last year
linto-ai / linto-stt
An automatic speech recognition API
☆54Updated this week
solarsamuel / pi5_whisper_voice_assistant
This is a Raspberry Pi 5 whisper C++ voice assistant - backwards compatible with Pi4
☆20Updated last year
daily-demos / llm-talk
Talk to GPT-4 and create a story together.
☆88Updated last year
pyannote / AMI-diarization-setup
☆39Updated last year
revdotcom / reverb-self-hosted
This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.
☆53Updated 3 months ago
revolutionarybukhari / ai-calling-agent
I have used technologies like Twilio , openai , pinecone , Mongodb, to make an automated calling agent for both inbound and outbound call…
☆17Updated 6 months ago
coqui-ai / whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆45Updated 8 months ago
ngbala6 / Audio-Processing
This repo is for Audio Processing Techniques and the Silence Remove using Python
☆17Updated 4 years ago
clement-pages / gryannote
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
☆60Updated last week
coqui-ai / data-checker
🫠 check your data, before you wreck your model
☆16Updated 2 years ago
Mobile-Artificial-Intelligence / babylon.cpp
Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…
☆16Updated 6 months ago
AGIPrime / Theraxus
Theraxus AI: A modular conversational AI platform ⚙️ blending STT 🎙️, TTS 🗣️, and RAG 📚 for seamless, context-aware dialogues and huma…
☆25Updated 4 months ago
sanchit-gandhi / notebooks
A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).
☆44Updated 7 months ago
akiani / aidialer
A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding too…
☆123Updated 7 months ago
synesthesiam / coqui-docker
Docker images for Coqui AI
☆57Updated 3 years ago