A semantic caching layer for LLM apps. It’s meant to cut down on repeated API calls even when the user phrases things differently
☆14Jul 3, 2025Updated 9 months ago
Alternatives and similar repositories for cachelm
Users that are interested in cachelm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆29Mar 15, 2026Updated last month
- Asynchronous Python framework designed to streamline the development of event-driven services☆34Feb 26, 2026Updated last month
- A WhatsApp marketing and messaging tool MCP (Model Control Protocol) service using Titanmind. Handles free-form messages (24hr window) an…☆18Jul 16, 2025Updated 9 months ago
- Training an LLM to use a calculator with multi-turn reinforcement learning, achieving a **62% absolute increase in evaluation accuracy**.☆70May 5, 2025Updated 11 months ago
- AI Agent for managing your Gmail account using natural language.☆24Sep 22, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Shared Memory Storage for Multi-Agent Systems☆150Jul 2, 2025Updated 9 months ago
- ☆56Apr 7, 2026Updated last week
- BIZCircularProgressView is a subclass of UIView that adds round progress view with timer.☆14Jan 5, 2016Updated 10 years ago
- Streamable HTTP based MCP server and Client demo with auto registry, Dockerfile setup and env.☆19May 30, 2025Updated 10 months ago
- Modified Beam Search with periodical restart☆12Sep 12, 2024Updated last year
- Natural language → shell command, just press TAB☆34Mar 6, 2026Updated last month
- ☆27Sep 13, 2024Updated last year
- LLM-powered macOS automation agent. Control Mail, Calendar, Reminders via natural language using AppleScript. Telegram voice commands, br…☆26Mar 31, 2026Updated 2 weeks ago
- ☆26Feb 15, 2026Updated 2 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Anthropic's Contextual Retrieval implementation with visual chunk comparison. Preview context enrichment before/after embedding.☆27Sep 25, 2025Updated 6 months ago
- ☆40Sep 7, 2025Updated 7 months ago
- Unofficial Claude Code SDKs for Typescript and Python☆15May 20, 2025Updated 10 months ago
- Yeet 88 agents at a problem and see what survives.☆24Feb 5, 2026Updated 2 months ago
- A curated collection of persona-based mcp server & tool groupings.☆36Sep 11, 2025Updated 7 months ago
- A production-ready multi-tenant RAG as a Service (RaaS) orchestrator☆31Nov 10, 2025Updated 5 months ago
- Evolutionary Search for expert-level performance on any task with environmental feedback☆14Oct 12, 2025Updated 6 months ago
- ☆12Aug 1, 2025Updated 8 months ago
- OpenCode Monitor is a desktop app for monitoring and interacting with OpenCode agents across multiple workspaces.☆40Mar 8, 2026Updated last month
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Controllable Language Model Interactions in TypeScript☆10May 17, 2024Updated last year
- Identify and automatically fix issues in shell scripts☆15Nov 24, 2023Updated 2 years ago
- AI_Powered_Dev_Search_Engine☆12Mar 10, 2024Updated 2 years ago
- A simple streamlit app to play with qwen3-2b-VL to perform OCR. Dockerized set up, tested with 3060 12 GB.☆31Nov 23, 2025Updated 4 months ago
- Evolution of Discrete data with Reinforcement Learning☆13Dec 8, 2019Updated 6 years ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Aug 30, 2024Updated last year
- Professional Wargaming LLM Toolbox☆21Jul 9, 2025Updated 9 months ago
- ☆13Aug 12, 2024Updated last year
- A Kotlin based terminal command to interact with the OpenAI Assistants API in a slightly geeky way...☆14Jun 23, 2024Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- A lightweight, high-performance text embedding model implemented in Rust.☆76Mar 4, 2026Updated last month
- A pytest plugin to organize and track algorithm visualizations☆18Dec 1, 2024Updated last year
- Ship agents you can audit.☆88Nov 9, 2025Updated 5 months ago
- ☆34Aug 19, 2025Updated 7 months ago
- A silly and weirdly useful experiment where I attempt to encode one bit of information with a VAE☆11Dec 31, 2016Updated 9 years ago
- Multi-vault, user-configured cloud hosted password manager☆16Jun 22, 2025Updated 9 months ago
- ☆22Jun 10, 2025Updated 10 months ago