herrera-luis / vision-core-aiLinks
Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.
☆46Updated 2 years ago
Alternatives and similar repositories for vision-core-ai
Users that are interested in vision-core-ai are comparing it to the libraries listed below
Sorting:
- LLaVA server (llama.cpp).☆183Updated 2 years ago
- ☆158Updated 2 years ago
- Scripts to create your own moe models using mlx☆90Updated last year
- GRDN.AI app for garden optimization☆69Updated last month
- Command-line script for inferencing from models such as falcon-7b-instruct☆75Updated 2 years ago
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆86Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆23Updated last year
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- Inference of Large Multimodal Models in C/C++. LLaVA and others☆48Updated 2 years ago
- ☆175Updated 2 years ago
- ☆69Updated 9 months ago
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆46Updated last year
- Joint speech-language model - respond directly to audio!☆371Updated last year
- Open-source Rewind.ai clone written in Rust and Vue running 100% locally with whisper.cpp☆51Updated 2 years ago
- ASR + diarization model server with speculative decoding☆63Updated last year
- Integrate an LLM copilot within your Keras model development workflow☆28Updated 2 years ago
- Command-line script for inferencing from models such as MPT-7B-Chat☆100Updated 2 years ago
- Chat to Compose Video☆197Updated last year
- A ggml (C++) re-implementation of tortoise-tts☆194Updated last year
- Fast approximate inference on a single GPU with sparsity aware offloading☆39Updated 2 years ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆85Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆161Updated last year
- ☆38Updated last year
- ☆127Updated 9 months ago
- Minimal, clean code implementation of RAG with mlx using gguf model weights☆53Updated last year
- All the world is a play, we are but actors in it.☆49Updated 5 months ago
- PlayHT Python SDK - AI Text-to-Speech Streaming & Voice Cloning API☆219Updated this week
- Local ML voice chat using high-end models.☆180Updated last month
- GGML implementation of BERT model with Python bindings and quantization.☆58Updated last year