bentoml / llm-inference-handbookLinks
Everything you need to know about LLM inference
☆223Updated this week
Alternatives and similar repositories for llm-inference-handbook
Users that are interested in llm-inference-handbook are comparing it to the libraries listed below
Sorting:
- RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lig…☆225Updated 8 months ago
- Content addressable storage with excellent search☆339Updated this week
- Securely run AI-generated code in stateful sandboxes that run forever.☆218Updated 4 months ago
- Parallel thinking for LLMs. Confidence‑gated, strategy‑driven, offline‑friendly☆146Updated last week
- High-Performance Implementation of OpenAI's TikToken.☆445Updated last month
- See Through Your Models☆400Updated last month
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆220Updated 8 months ago
- LLM plugin for pulling content from Hacker News☆116Updated 3 months ago
- Your toolkit for autonomous, evolving agent ecosystems. Create, execute, govern, and evolve agents that learn from experience, collaborat…☆438Updated 2 weeks ago
- Pixelagent — Multimodal stateful agents☆216Updated 2 months ago
- A comprehensive Model Context Protocol (MCP) server implementing the latest specification.☆333Updated 2 months ago
- ☆142Updated this week
- An AI-generated book exploring how artificial intelligence development reveals hidden patterns in human cognition and communication☆69Updated 3 weeks ago
- Physical AI Assistant that illuminates your life☆151Updated last week
- CleverBee - The Open Source Deep Researcher Tool☆304Updated 2 months ago
- Your unified, shareable memory layer for AI apps. Compatible with Cursor, Claude Desktop, Claude Code, Gemini CLI, Windsurf, AWS's Kiro, …☆546Updated this week
- Git Based Memory Storage for Conversational AI Agent☆537Updated last week
- ☆151Updated last month
- A Python toolkit for chain-of-thought prompting 🐍☆172Updated this week
- Your filesystem as a vector database☆467Updated 4 months ago
- Animating R1's thoughts.☆384Updated 6 months ago
- EnrichMCP is a python framework for building data driven MCP servers☆601Updated this week
- Build Secure and Compliant AI agents and MCP Servers. YC W23☆149Updated 2 months ago
- A very simple tool to build LLM prompts from your code repositories.☆155Updated last month
- Applying the ideas of Deepseek R1 to computer use☆216Updated 6 months ago
- Heirarchical Navigable Small Worlds☆101Updated 3 weeks ago
- ☆197Updated 3 months ago
- Detect whether or not an audio file was generated by NotebookLM☆139Updated 9 months ago
- Minimal AI agent framework that just works with only seven tools.☆548Updated last month
- ai for jq☆244Updated 11 months ago