A web-based calculator for estimating GPU memory requirements and maximum concurrent requests for self-hosted LLM inference.
☆46May 20, 2026Updated this week
Alternatives and similar repositories for selfhostllm
Users that are interested in selfhostllm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Lightweight agentic coding environment☆28Updated this week
- Code for the paper "Coding Agents with Multimodal Browsing are Generalist Problem Solvers"☆102Oct 27, 2025Updated 6 months ago
- Recursive Self-Aggregation evals on ARC-AGI☆36Jan 26, 2026Updated 4 months ago
- ☆19Aug 23, 2025Updated 9 months ago
- ☆40Mar 26, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- vTPM with SGX protection☆11May 30, 2019Updated 6 years ago
- The official repo for "CodeScaler: Scaling Code LLM Training and Test-Time Inference via Execution-Free Reward Models"☆33Mar 26, 2026Updated 2 months ago
- libtpms / swtpm software emulation of a Trusted Platform Module (TPM 1.2 and TPM 2.0) compile script☆13Sep 16, 2020Updated 5 years ago
- Metadata Editor user and practice guide☆18May 8, 2026Updated 2 weeks ago
- Authenticated independently verifiable agent delegation.☆33Dec 17, 2025Updated 5 months ago
- Verify that any MCP server is running the intended and untampered code via hardware attestation.☆18Updated this week
- ZFS pool scrubber and monitor script☆12Feb 15, 2013Updated 13 years ago
- k8s CSI driver for FastCFS☆13Mar 17, 2024Updated 2 years ago
- ☆17Nov 26, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- SnapDocs - A Modern, Open-Source Document Workspace☆25Sep 7, 2025Updated 8 months ago
- ☆43Jan 16, 2026Updated 4 months ago
- Laravel Event CRUD with Full calendar☆12Jan 29, 2020Updated 6 years ago
- An intelligence layer grounding autonomous agents in verified, real-time knowledge at scale.☆74Mar 14, 2026Updated 2 months ago
- Saas Agency Management, CRM, Website Builder and Dashboard. Built with the latest Next.js and Typescript, this project creates a beautifu…☆18May 9, 2024Updated 2 years ago
- This is an example of Spatie Laravel Dashboard using Livewire and package components. This example has all settings extended to the `dash…☆22Nov 10, 2023Updated 2 years ago
- Copy My Writing is a command-line tool for generating content based on your personal writing style.☆11Oct 12, 2025Updated 7 months ago
- Production-ready Next.js SaaS starter with Auth.js, Drizzle, PostgreSQL, RBAC, admin UI, i18n, uploads and Docker.☆20May 18, 2026Updated last week
- this repository is used for web3.js priactice☆10Mar 24, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Spellbound - your multilingual AI-powered writing assistant☆13May 12, 2025Updated last year
- An MCP server implementation that integrates with SearXNG, providing privacy-focused meta search capabilities.☆31May 11, 2025Updated last year
- Socratic-Zero is a fully autonomous framework that generates high-quality training data for mathematical reasoning☆36Oct 26, 2025Updated 7 months ago
- A custom Huggingface trainer which supports logging auxiliary losses returned by your model☆15Jul 27, 2025Updated 9 months ago
- Mini-Projects using Cutting-Edge AI Frameworks☆15Apr 3, 2026Updated last month
- [ICML2025 Oral] LoRA-One: One-Step Full Gradient Could Suffice for Fine-Tuning Large Language Models, Provably and Efficiently☆31Oct 22, 2025Updated 7 months ago
- weavetui is a modern, robust, and modular Text User Interface (TUI) framework for Rust, built on top of ratatui and tokio☆24Mar 7, 2026Updated 2 months ago
- Evaluate how vLLM and SGLang perform when running a small LLM model on a mid-range NVIDIA GPU☆21May 10, 2026Updated 2 weeks ago
- Large DNNs training framework for consumer GPUs☆78May 18, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Whim is a simple and secure app for sharing secret messages anonymously. The messages are encrypted and are vanished after being read. No…☆29Oct 22, 2025Updated 7 months ago
- Enemies for your LLM☆36Jan 20, 2026Updated 4 months ago
- ☆39Aug 4, 2025Updated 9 months ago
- 这是一套专为 Codex 适配的 Android 逆向分析 skill,支持在 Codex 会话中反编译 APK、XAPK、JAR、AAR,并结合 jadx、Fernflower/Vineflower 梳理 Manifest、包结构、网络层和调用链。它可辅助提取接口、U…☆112May 7, 2026Updated 2 weeks ago
- BMAD skills and workflows for OpenAI Codex (App, CLI, Web): intent-based execution, YAML project state, and reusable skill packs for plan…☆34May 19, 2026Updated last week
- Simulating qubits in JavaScript☆12Apr 4, 2016Updated 10 years ago
- 🚀 Use Firecracker and helpings of bash to boot Ubuntu virtual machines very fast 🔥☆22Jul 15, 2023Updated 2 years ago