herrera-luis / vision-core-ai
Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.
β46Updated last year
Alternatives and similar repositories for vision-core-ai:
Users that are interested in vision-core-ai are comparing it to the libraries listed below
- Scripts to create your own moe models using mlxβ89Updated last year
- GRDN.AI app for garden optimizationβ70Updated last year
- π Scale your RAG pipeline using Ragswift: A scalable centralized embeddings management platformβ37Updated last year
- LLaVA server (llama.cpp).β178Updated last year
- [WIP] AI Try-On plugin for Chromeβ27Updated last year
- All the world is a play, we are but actors in it.β47Updated this week
- β39Updated last year
- run ollama & gguf easily with a single commandβ49Updated 10 months ago
- Video+code lecture on building nanoGPT from scratchβ66Updated 9 months ago
- Gradio UI for a Cog APIβ66Updated 11 months ago
- A Python library to orchestrate LLMs in a neural network-inspired structureβ46Updated 5 months ago
- Inference of Large Multimodal Models in C/C++. LLaVA and othersβ46Updated last year
- auto fine tune of models with synthetic dataβ74Updated last year
- Open-source AI for voice control, rivaling Alexa and Siriβ12Updated last year
- β38Updated last year
- Port of Suno's Bark TTS transformer in Apple's MLX Frameworkβ78Updated last year
- Embed anything.β29Updated 9 months ago
- Cog wrapper for collabora/WhisperSpeechβ25Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).β44Updated 7 months ago
- β54Updated last year
- Testing and evaluating the capabilities of Vision-Language models (PaliGemma) in performing computer vision tasks such as object detectioβ¦β80Updated 9 months ago
- β20Updated last year
- Very basic framework for composable parameterized large language model (Q)LoRA / (Q)Dora fine-tuning using mlx, mlx_lm, and OgbujiPT.β37Updated 3 weeks ago
- Mistral-7B finetuned for function callingβ15Updated last year
- Transcribe and summarize videos using whisper and llms on apple mlx frameworkβ73Updated last year
- Deploy your GGML models to HuggingFace Spaces with Docker and gradioβ36Updated last year
- Python Server for C3 AI app. A project that brings the power of Large Language Models (LLM) and Retrieval-Augmented Generation (RAG) withβ¦β22Updated last year
- ASR + diarization model server with speculative decodingβ59Updated 10 months ago
- The one who calls upon functions - Function-Calling Language Modelβ36Updated last year
- β40Updated 11 months ago