abgulati / kosmos-2_5-containerizedView external linksLinks
Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-script based invocation make it difficult to use for application development. Here, it has been containerized and made available via an API, greatly enhancing its ease-of-use.
☆67Jul 22, 2024Updated last year
Alternatives and similar repositories for kosmos-2_5-containerized
Users that are interested in kosmos-2_5-containerized are comparing it to the libraries listed below
Sorting:
- Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…☆11Oct 28, 2024Updated last year
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆39Jan 27, 2026Updated 2 weeks ago
- JotItNow is a AI Voice Notes App☆24Mar 6, 2025Updated 11 months ago
- Personal voice assistant, with voice interruption and Twilio support☆18Feb 24, 2025Updated 11 months ago
- An fully autonomous agent that accesses the browser and performs tasks.☆17Apr 25, 2025Updated 9 months ago
- ☆23Dec 9, 2025Updated 2 months ago
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆25Dec 20, 2024Updated last year
- ☆15Apr 9, 2025Updated 10 months ago
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆16Updated this week
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- ☆51Feb 19, 2025Updated 11 months ago
- Self-hosted AI medical scribe.☆66Jan 21, 2026Updated 3 weeks ago
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Jan 30, 2025Updated last year
- LLamaHTML is a simple html file to communicate with a running llamacpp llama-server☆22Aug 5, 2025Updated 6 months ago
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆27Dec 29, 2025Updated last month
- ☆14Aug 25, 2024Updated last year
- An interface that features barely zero external dependencies beyond the Ollama API itself, making it lightweight and portable to easily i…☆12Mar 25, 2025Updated 10 months ago
- ☆17Apr 22, 2024Updated last year
- Crow is a Desktop AI Assistant☆32Aug 9, 2024Updated last year
- An OpenAI API compatible images server to generate or manipulate images.☆17Feb 2, 2025Updated last year
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 4 months ago
- Open source Speechify alternative. Read PDFs and EPUBs with local models.☆35Nov 14, 2025Updated 3 months ago
- ☆18Aug 19, 2025Updated 5 months ago
- Model Context Protocol (MCP) server that provides a flexible and configurable two-stage reasoning and response generation system☆14Mar 4, 2025Updated 11 months ago
- 🤖 AI-powered CLI for file reorganization. Runs fully locally — no data leaves your machine.☆19Jul 2, 2025Updated 7 months ago
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆630Oct 29, 2024Updated last year
- ☆21Dec 22, 2024Updated last year
- LLM FX: A LLM Server Desktop Client free for everyone!☆33Dec 19, 2025Updated last month
- These agents work based on any local model. You ask your question and simply indicate the number of agents and experts who will answer it…☆19Feb 25, 2024Updated last year
- Physics Master is a model fine-tuned from llama3-8B-Instruct. It can answer your physics question!☆16Aug 24, 2024Updated last year
- ☆19Jul 4, 2025Updated 7 months ago
- A novel media player that allows you to navigate by speaker☆87Dec 22, 2025Updated last month
- An extension for oobabooga/text-generation-webui that automatically unloads and reloads your model.☆17Apr 22, 2024Updated last year
- ☆17Dec 16, 2024Updated last year
- Easily view and modify JSON datasets for large language models☆87May 16, 2025Updated 8 months ago
- Chat with AI using whisper, LLMs, and TTS☆24Jun 26, 2024Updated last year
- ☆24Jan 31, 2026Updated 2 weeks ago
- Benchmarking LLMs as Casual Card Game AIs☆20Jan 22, 2025Updated last year