daerese / stella-gpt
StellaGPT is an AI voice assistant in the form of a desktop application.
☆10Updated last year
Alternatives and similar repositories for stella-gpt
Users that are interested in stella-gpt are comparing it to the libraries listed below
Sorting:
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆55Updated last month
- Bringing large-language models and chat to web browsers. Everything runs inside the browser with no server support.☆14Updated last year
- A curated list of awesome voice activity detection☆50Updated 5 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆62Updated last month
- Open TTS models, built for streaming on the edge☆41Updated 2 months ago
- ☆42Updated 3 weeks ago
- Use quantized versions of Whisper to speed up inference☆12Updated 7 months ago
- Joint speech-language model - respond directly to audio!☆30Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆52Updated 5 months ago
- AI Search engine☆12Updated 2 months ago
- Zeta implementation of a reusable and plug in and play feedforward from the paper "Exponentially Faster Language Modeling"☆16Updated 6 months ago
- A python library to find differences between audio and transcriptions☆20Updated last year
- Experimental sampler to make LLMs more creative☆31Updated last year
- On-device speaker diarization powered by deep learning☆45Updated last week
- Hub for Open Source AGiXT Extensions, Chains, Prompts, and Agents.☆17Updated last year
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Updated 11 months ago
- VoiceBox neural network implementation☆107Updated 9 months ago
- NewsAgent is an enterprise-grade news aggregation agent designed to fetch, query, and summarize news from multiple sources at scale.☆17Updated 3 weeks ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated last year
- A character chat with integrated medium and long-term memory☆16Updated 4 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Updated 4 months ago
- A Voice Assistant in your Browser.☆21Updated this week
- A fast MP3 decoder for python, using minimp3☆28Updated 2 years ago
- Open Server is an OpenAI API Compatible Server for generating text, images, embeddings, and storing them in vector databases. It also inc…☆16Updated last year
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- Discord chatbot interface to train an LLM on user message history☆27Updated last year
- Official repository for the "Powerset multi-class cross entropy loss for neural speaker diarization" paper published in Interspeech 2023.☆83Updated last year
- Speaker diarization model☆27Updated 2 years ago
- Genie in the Box: Distill Whisper STT => Mistral-7B => Phind/Phind-CodeLlama-34B-v2 => GPT 3.5 => Coqui's TTS/OpenAI TTS☆16Updated last year
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆16Updated last week