⚡ Python-free Rust inference server — OpenAI-API compatible. GGUF + SafeTensors, hot model swap, auto-discovery, single binary. FREE now, FREE forever.
☆3,753Jan 16, 2026Updated last month
Alternatives and similar repositories for shimmy
Users that are interested in shimmy are comparing it to the libraries listed below
Sorting:
- Python tool for converting files and office documents to Markdown.☆90,316Feb 20, 2026Updated 2 weeks ago
- Fast, flexible LLM inference☆6,653Feb 27, 2026Updated last week
- The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-g…☆43,229Updated this week
- Simultaneous speech-to-text model☆9,806Updated this week
- LLM-powered framework for deep document understanding, semantic retrieval, and context-aware answers using RAG paradigm.☆13,231Updated this week
- Run frontier AI locally.☆42,347Updated this week
- Minimalist ML framework for Rust☆19,600Updated this week
- ☆28,296Jan 12, 2026Updated last month
- Transforms complex documents like PDFs into LLM-ready markdown/JSON for your Agentic workflows.☆55,756Updated this week
- 🐬DeepChat - A smart assistant that connects powerful AI to your personal world☆5,552Updated this week
- Production-ready platform for agentic workflow development.☆131,572Updated this week
- Toolkit for linearizing PDFs for LLM datasets/training☆16,979Updated this week
- Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train OpenAI gpt-oss, DeepSeek, Qwen, Llama, Gemma, TTS 2x faster with 70% less VRAM.☆53,029Mar 3, 2026Updated last week
- Universal memory layer for AI Agents☆48,604Updated this week
- Memory layer for AI Agents. Replace complex RAG pipelines with a serverless, single-file memory layer. Give your agents instant retrieval…☆13,266Updated this week
- The conversational control layer for customer-facing AI agents - Parlant is a context-engineering framework optimized for controlling cus…☆17,799Updated this week
- Official inference framework for 1-bit LLMs☆28,697Feb 3, 2026Updated last month
- FlowGram is an extensible workflow development framework with built-in canvas, form, variable, and materials that helps developers build …☆7,745Updated this week
- 🤱🏻 Turn any webpage into a desktop app with one command.☆46,527Updated this week
- AI productivity studio with smart chat, autonomous agents, and 300+ assistants. Unified access to frontier LLMs☆40,661Updated this week
- Distributed inference for mobile, desktop and server.☆2,948Updated this week
- Jan is an open source alternative to ChatGPT that runs 100% offline on your computer.☆40,860Updated this week
- The ultimate space for work and life — to find, build, and collaborate with agent teammates that grow with you. We are taking agent harne…☆73,318Updated this week
- 🔥 MaxKB is an open-source platform for building enterprise-grade agents. 强大易用的开源企业级智能体平台。☆20,227Mar 3, 2026Updated last week
- A simple screen parsing tool towards pure vision based GUI agent☆24,448Sep 12, 2025Updated 5 months ago
- Build, run, manage agentic software at scale.☆38,516Updated this week
- 🚀🚀 「大模型」2小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 2h!☆40,754Feb 6, 2026Updated last month
- A visual playground for agentic workflows: Iterate over your agents 10x faster☆5,697Jul 20, 2025Updated 7 months ago
- Get up and running with Kimi-K2.5, GLM-5, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.☆164,248Updated this week
- A Python library for extracting structured information from unstructured text using LLMs with precise source grounding and interactive vi…☆34,244Feb 25, 2026Updated last week
- 🌐 Make websites accessible for AI agents. Automate tasks online with ease.☆79,644Mar 3, 2026Updated last week
- The Open-Source Multimodal AI Agent Stack: Connecting Cutting-Edge AI Models and Agent Infra☆28,593Feb 27, 2026Updated last week
- Turso is an in-process SQL database, compatible with SQLite.☆17,636Updated this week
- screenpipe turns your computer into a personal AI that knows everything you've done. record. search. automate. all local, all private, al…☆17,168Updated this week
- A research prototype of a human-centered web agent☆9,705Feb 12, 2026Updated 3 weeks ago
- A Gemini 2.5 Flash Level MLLM for Vision, Speech, and Full-Duplex Multimodal Live Streaming on Your Phone☆24,027Feb 23, 2026Updated 2 weeks ago
- Autonomous coding agent right in your IDE, capable of creating/editing files, executing commands, using the browser, and more with your p…☆58,756Updated this week
- 🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)☆6,787Jul 4, 2025Updated 8 months ago
- The all-in-one AI productivity accelerator. On device and privacy first with no annoying setup or configration.☆55,868Updated this week