A web-based calculator for estimating GPU memory requirements and maximum concurrent requests for self-hosted LLM inference.
☆46Jun 11, 2026Updated this week
Alternatives and similar repositories for selfhostllm
Users that are interested in selfhostllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Interact with various LLMs in your browser (LangChain.js, Angular)☆17May 7, 2026Updated last month
- Recursive Self-Aggregation evals on ARC-AGI☆36Jan 26, 2026Updated 4 months ago
- ☆19Aug 23, 2025Updated 9 months ago
- ☆41May 26, 2026Updated 2 weeks ago
- vTPM with SGX protection☆12May 30, 2019Updated 7 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- libtpms / swtpm software emulation of a Trusted Platform Module (TPM 1.2 and TPM 2.0) compile script☆13Sep 16, 2020Updated 5 years ago
- ☆21Dec 9, 2025Updated 6 months ago
- Source code to accompany research paper on training multi token prediction language models using self-distillation.☆38Feb 21, 2026Updated 3 months ago
- Verify that any MCP server is running the intended and untampered code via hardware attestation.☆19May 20, 2026Updated 3 weeks ago
- ZFS pool scrubber and monitor script☆12Feb 15, 2013Updated 13 years ago
- k8s CSI driver for FastCFS☆13Mar 17, 2024Updated 2 years ago
- 📦 Repopack is a powerful tool that packs your entire repository into a single, AI-friendly file. Perfect for when you need to feed your …☆16Oct 15, 2024Updated last year
- SnapDocs - A Modern, Open-Source Document Workspace☆25Sep 7, 2025Updated 9 months ago
- ☆16Sep 22, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆21Jun 5, 2026Updated last week
- An intelligence layer grounding autonomous agents in verified, real-time knowledge at scale.☆75Mar 14, 2026Updated 3 months ago
- Javascript library to display a calendar like a gantt planning☆15Dec 17, 2018Updated 7 years ago
- <connect-it> is a web component that allows you to create various types of diagrams, such as flowcharts, mind maps, network diagrams, org…☆16Feb 28, 2023Updated 3 years ago
- Copy My Writing is a command-line tool for generating content based on your personal writing style.☆11Oct 12, 2025Updated 8 months ago
- The core repository for Katanemo's advanced function calling models with top-tier performance. Features three collections: Arch-Function …☆22Jun 23, 2025Updated 11 months ago
- Spellbound - your multilingual AI-powered writing assistant☆13May 12, 2025Updated last year
- An MCP server implementation that integrates with SearXNG, providing privacy-focused meta search capabilities.☆32May 11, 2025Updated last year
- Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning☆36Oct 26, 2025Updated 7 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- A custom Huggingface trainer which supports logging auxiliary losses returned by your model☆15Jul 27, 2025Updated 10 months ago
- ☆32Aug 27, 2024Updated last year
- [ICML2025 Oral] LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently☆32Oct 22, 2025Updated 7 months ago
- weavetui is a modern, robust, and modular Text User Interface (TUI) framework for Rust, built on top of ratatui and tokio☆24Mar 7, 2026Updated 3 months ago
- Evaluate how vLLM and SGLang perform when running a small LLM model on a mid-range NVIDIA GPU☆21May 31, 2026Updated 2 weeks ago
- Whim is a simple and secure app for sharing secret messages anonymously. The messages are encrypted and are vanished after being read. No…☆29May 26, 2026Updated 2 weeks ago
- 🧠 High-performance persistent memory system for Model Context Protocol (MCP) powered by libSQL. Features vector search, semantic knowled…☆85Jun 6, 2026Updated last week
- Enemies for your LLM☆37Jan 20, 2026Updated 4 months ago
- ☆39Aug 4, 2025Updated 10 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Crossword puzzles in your terminal.☆23Feb 4, 2026Updated 4 months ago
- BMAD skills and workflows for OpenAI Codex (App, CLI, Web): intent-based execution, YAML project state, and reusable skill packs for plan…☆36May 19, 2026Updated 3 weeks ago
- Simulating qubits in JavaScript☆12Apr 4, 2016Updated 10 years ago
- 🚀 Use Firecracker and helpings of bash to boot Ubuntu virtual machines very fast 🔥☆22Jul 15, 2023Updated 2 years ago
- tldw☆12Jul 5, 2025Updated 11 months ago
- kubectl plugin to isolate a pod from the service.☆10Jul 5, 2020Updated 5 years ago
- Command-line toolkit for interactive SQL and data manipulation on CSV, Parquet, JSON, and Avro files. Powered by Apache Arrow and DataFus…☆15May 2, 2025Updated last year