matatonic / openedai-visionLinks
An OpenAI API compatible API for chat with image input and questions about the images. aka Multimodal.
☆266Updated 11 months ago
Alternatives and similar repositories for openedai-vision
Users that are interested in openedai-vision are comparing it to the libraries listed below
Sorting:
- automatically quant GGUF models☆219Updated last month
- ☆209Updated last month
- A multimodal, function calling powered LLM webui.☆216Updated last year
- AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models☆168Updated last year
- Efficient visual programming for AI language models☆360Updated 8 months ago
- Open source LLM UI, compatible with all local LLM providers.☆177Updated last year
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆134Updated last year
- A fast batching API to serve LLM models☆189Updated last year
- ☆109Updated 5 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆119Updated last year
- Dataset Crafting w/ RAG/Wikipedia ground truth and Efficient Fine-Tuning Using MLX and Unsloth. Includes configurable dataset annotation …☆193Updated last year
- A frontend for creative writing with LLMs☆146Updated last year
- A pipeline parallel training script for LLMs.☆166Updated 9 months ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆244Updated last year
- A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow you…☆240Updated 3 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated last year
- ☆134Updated 2 months ago
- Web UI for ExLlamaV2☆513Updated last year
- Easily view and modify JSON datasets for large language models☆87Updated 8 months ago
- AI Powered search tool offers content-based, text, and visual similarity system-wide search.☆276Updated 8 months ago
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆211Updated 9 months ago
- An extension for oobabooga/text-generation-webui that enables the LLM to search the web☆275Updated last month
- ☆83Updated 11 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆57Updated last year
- Python package wrapping llama.cpp for on-device LLM inference☆100Updated 3 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Updated last year
- An AI assistant beyond the chat box.☆329Updated last year
- AI management tool☆119Updated last year
- This is the Mixture-of-Agents (MoA) concept, adapted from the original work by TogetherAI. My version is tailored for local model usage a…☆118Updated last year
- Serving LLMs in the HF-Transformers format via a PyFlask API☆72Updated last year