menloresearch / ichigoLinks

Local realtime voice AI

☆2,370

Alternatives and similar repositories for ichigo

Users that are interested in ichigo are comparing it to the libraries listed below

Sorting:

Standard-Intelligence / hertz-dev
first base model for full-duplex conversational audio
☆1,764Updated 9 months ago
ictnlp / LLaMA-Omni
LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve spee…
☆3,073Updated 4 months ago
moonshine-ai / moonshine
Fast and accurate automatic speech recognition (ASR) for edge devices
☆2,903Updated last month
fixie-ai / ultravox
A fast multimodal LLM for real-time voice
☆4,211Updated last month
pipecat-ai / smart-turn
☆961Updated 3 weeks ago
aiola-lab / whisper-medusa
Whisper with Medusa heads
☆859Updated 2 months ago
edwko / OuteTTS
Interface for OuteTTS models.
☆1,384Updated 3 months ago
kyutai-labs / hibiki
Hibiki is a model for streaming speech translation (also known as simultaneous translation). Unlike offline translation—where one waits f…
☆1,291Updated 5 months ago
mezbaul-h / june
Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit
☆785Updated last year
facebookresearch / spiritlm
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
☆918Updated 11 months ago
sofi444 / realtime-transcription-fastrtc
Real Time Speech Transcription with FastRTC ⚡️and Local Whisper 🤗
☆683Updated 3 months ago
kyutai-labs / unmute
Make text LLMs listen and speak
☆904Updated last week
collabora / WhisperFusion
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
☆1,631Updated last year
lucasnewman / f5-tts-mlx
Implementation of F5-TTS in MLX
☆589Updated 6 months ago
lifeiteng / OmniSenseVoice
Omni SenseVoice: High-Speed Speech Recognition with words timestamps 🗣️🎯
☆867Updated 7 months ago
huggingface / smollm
Everything about the SmolLM and SmolVLM family of models
☆3,300Updated 3 weeks ago
lhl / voicechat2
Local SRT/LLM/TTS Voicechat
☆725Updated 11 months ago
KoljaB / RealtimeVoiceChat
Have a natural, spoken conversation with AI!
☆3,219Updated 3 months ago
WhisperSpeech / WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
☆4,472Updated 4 months ago
dleemiller / WordLlama
Things you can do with the token embeddings of an LLM
☆1,448Updated 6 months ago
DigitalPhonetics / IMS-Toucan
Controllable and fast Text-to-Speech for over 7000 languages!
☆1,646Updated 3 months ago
mustafaaljadery / lightning-whisper-mlx
An extremely fast implementation of whisper optimized for Apple Silicon using MLX.
☆790Updated last year
Camb-ai / MARS5-TTS
MARS5 speech model (TTS) from CAMB.AI
☆2,794Updated last year
alexpinel / Dot
Text-To-Speech, RAG, and LLMs. All local!
☆1,830Updated 10 months ago
speaches-ai / speaches
☆2,472Updated this week
canopyai / Orpheus-TTS
Towards Human-Sounding Speech
☆5,603Updated 5 months ago
gabrielchua / open-notebooklm
Convert any PDF into a podcast episode!
☆2,476Updated 10 months ago
huggingface / speech-to-speech
Speech To Speech: an effort for an open-sourced and modular GPT4-o
☆4,197Updated 5 months ago
KoljaB / RealtimeTTS
Converts text to speech in realtime
☆3,563Updated 2 months ago
pipecat-ai / pipecat
Open Source framework for voice and multimodal conversational AI
☆8,317Updated this week