Kosmos-2.5 is a cutting-edge Multimodal-LLM (MLLM) specializing in image OCR. However, its stringent software requirements & Python-script based invocation make it difficult to use for application development. Here, it has been containerized and made available via an API, greatly enhancing its ease-of-use.
☆68Jul 22, 2024Updated last year
Alternatives and similar repositories for kosmos-2_5-containerized
Users that are interested in kosmos-2_5-containerized are comparing it to the libraries listed below
Sorting:
- Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…☆11Oct 28, 2024Updated last year
- LLM backed Fantasy Tribe Game☆19Nov 21, 2024Updated last year
- JotItNow is a AI Voice Notes App☆24Mar 6, 2025Updated last year
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆41Jan 27, 2026Updated last month
- Personal voice assistant, with voice interruption and Twilio support☆18Feb 24, 2025Updated last year
- An fully autonomous agent that accesses the browser and performs tasks.☆17Apr 25, 2025Updated 10 months ago
- ☆23Updated this week
- Proteus is an experimental platform that combines the power of Large Language Models with the Genesis physics engine☆26Dec 20, 2024Updated last year
- ☆15Apr 9, 2025Updated 10 months ago
- A thin cython wrapper around llama.cpp, whisper.cpp and stable-diffusion.cpp☆16Feb 10, 2026Updated 3 weeks ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆10Sep 14, 2025Updated 5 months ago
- ☆51Feb 19, 2025Updated last year
- A simple no-install web UI for Ollama and OAI-Compatible APIs!☆31Jan 30, 2025Updated last year
- ☆14Aug 25, 2024Updated last year
- Desktop application for instant AI-powered text transformation. Translate, correct, summarize, and change the tone of any text, anywhere,…☆28Dec 29, 2025Updated 2 months ago
- An interface that features barely zero external dependencies beyond the Ollama API itself, making it lightweight and portable to easily i…☆12Mar 25, 2025Updated 11 months ago
- ☆17Apr 22, 2024Updated last year
- Open source, local first AI medical scribe for desktop and web.☆76Updated this week
- An OpenAI API compatible images server to generate or manipulate images.☆17Feb 2, 2025Updated last year
- Crow is a Desktop AI Assistant☆32Aug 9, 2024Updated last year
- Mic-controlled mouse clicks☆17Oct 6, 2025Updated 5 months ago
- Open source Speechify alternative. Read PDFs and EPUBs with local models.☆38Nov 14, 2025Updated 3 months ago
- LLamaHTML is a simple html file to communicate with a running llamacpp llama-server☆22Aug 5, 2025Updated 7 months ago
- ☆18Aug 19, 2025Updated 6 months ago
- Model Context Protocol (MCP) server that provides a flexible and configurable two-stage reasoning and response generation system☆14Mar 4, 2025Updated last year
- An application for running LLMs locally on your device, with your documents, facilitating detailed citations in generated responses.☆631Oct 29, 2024Updated last year
- 🤖 AI-powered CLI for file reorganization. Runs fully locally — no data leaves your machine.☆20Jul 2, 2025Updated 8 months ago
- ☆21Dec 22, 2024Updated last year
- These agents work based on any local model. You ask your question and simply indicate the number of agents and experts who will answer it…☆19Feb 25, 2024Updated 2 years ago
- Physics Master is a model fine-tuned from llama3-8B-Instruct. It can answer your physics question!☆16Aug 24, 2024Updated last year
- ☆19Jul 4, 2025Updated 8 months ago
- A novel media player that allows you to navigate by speaker☆89Dec 22, 2025Updated 2 months ago
- ☆17Dec 16, 2024Updated last year
- An extension for oobabooga/text-generation-webui that automatically unloads and reloads your model.☆17Apr 22, 2024Updated last year
- Easily view and modify JSON datasets for large language models☆87May 16, 2025Updated 9 months ago
- Local first speech AI engine for transcription, TTS, and voice workflows.☆151Updated this week
- Building synthetic data for preference tuning☆27Dec 26, 2024Updated last year
- Benchmarking LLMs as Casual Card Game AIs☆20Jan 22, 2025Updated last year
- Chat with AI using whisper, LLMs, and TTS☆24Jun 26, 2024Updated last year