OneInterface / realtime-bakllava
llama.cpp with BakLLaVA model describes what does it see
☆383Updated last year
Alternatives and similar repositories for realtime-bakllava:
Users that are interested in realtime-bakllava are comparing it to the libraries listed below
- ☆277Updated 9 months ago
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆223Updated last year
- Run inference on replit-3B code instruct model using CPU☆154Updated last year
- An Autonomous LLM Agent that runs on Wizcoder-15B☆335Updated 6 months ago
- A simple UI / Web / Frontend for MLX mlx-lm using Streamlit.☆252Updated 3 months ago
- An mlx project to train a base model on your whatsapp chats using (Q)Lora finetuning☆167Updated last year
- A multimodal, function calling powered LLM webui.☆214Updated 7 months ago
- ☆135Updated last year
- ☆706Updated last year
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated last year
- LLaVA server (llama.cpp).☆180Updated last year
- function calling-based LLM agents☆285Updated 7 months ago
- Fine tune SDXL on YouTube videos☆175Updated 8 months ago
- FastMLX is a high performance production ready API to host MLX models.☆297Updated last month
- A fast batching API to serve LLM models☆182Updated last year
- An AI assistant beyond the chat box.☆328Updated last year
- Scripts to create your own moe models using mlx☆89Updated last year
- Bespoke Automata is a GUI and deployment pipline for making complex AI agents locally and offline☆221Updated 11 months ago
- ⚙️ Zero-Shot Autonomous Robots☆114Updated last year
- ☆113Updated 4 months ago
- BabyAGI to run with GPT4All☆249Updated last year
- ☆78Updated last year
- Port of Suno's Bark TTS transformer in Apple's MLX Framework☆81Updated last year
- SiLLM simplifies the process of training and running Large Language Models (LLMs) on Apple Silicon by leveraging the MLX framework.☆264Updated this week
- TheBloke's Dockerfiles☆303Updated last year
- The open-source implementation of Q*, achieved in context as a zero-shot reprogramming of the attention mechanism. (synthetic data)Updated 4 months ago
- Mac compatible Ollama Voice☆479Updated last year
- ☆154Updated 9 months ago
- Falcon LLM ggml framework with CPU and GPU support☆246Updated last year
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆146Updated last year