Run llama.cpp on RunPod
☆26Sep 5, 2023Updated 2 years ago
Alternatives and similar repositories for llama-runpod
Users that are interested in llama-runpod are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANA☆13Jul 22, 2019Updated 6 years ago
- Quick hack job to allow use with Sillytavern. This works for me, some further updates are expected to expose more settings to sillytavern☆11May 30, 2024Updated last year
- ☆15Oct 26, 2023Updated 2 years ago
- Modal LLM LLama.cpp based model deployment as part of series of Model as a Service (MaaS)☆17Mar 23, 2026Updated last month
- A set of scripts using FFmpeg library to process videos and audios☆14Jul 2, 2021Updated 4 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆16Oct 4, 2024Updated last year
- ☆11Feb 20, 2025Updated last year
- A wannabe Ollama equivalent for Apple MlX models☆86Mar 2, 2025Updated last year
- A Framework For Intelligence Farming☆16Apr 3, 2025Updated last year
- Running Ollama with Runpod☆64Jul 26, 2024Updated last year
- An MCP server implementation providing a standardized interface for LLMs to interact with the Atla API.☆18Jul 21, 2025Updated 9 months ago
- Protocol for Augmented Memory of Project Artifacts (MCP compatible) - extended☆24Jan 24, 2026Updated 3 months ago
- [ALPHA] Persist and recall information across any AI tooling, powered by SQLite + MCP + Local Embeddings☆20Jun 12, 2025Updated 10 months ago
- CompChomper is a framework for measuring how LLMs perform at code completion.☆21Apr 29, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- input aspect ratio, output dimensions☆21Mar 13, 2026Updated last month
- MDX is just an extension of Markdown that allows you to import and write JSX in your markdown documents.☆13Mar 16, 2023Updated 3 years ago
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆31Mar 22, 2026Updated last month
- ☆22Sep 20, 2025Updated 7 months ago
- A streaming local chatbot☆34Jul 3, 2025Updated 10 months ago
- Agentic BYOK Browser-Based Website Builder☆44Updated this week
- ☆50Nov 17, 2025Updated 5 months ago
- Create embeddings with infinity as serverless endpoint☆44Nov 21, 2025Updated 5 months ago
- A Field-Theoretic Approach to Unbounded Memory in Large Language Models☆20Apr 15, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Tools for the LLaMA language model☆12Apr 4, 2023Updated 3 years ago
- Implementation☆27Mar 22, 2025Updated last year
- ☆19Dec 9, 2023Updated 2 years ago
- RXJS wrapper for Gun Database☆12Dec 10, 2017Updated 8 years ago
- Starting point to build your own custom serverless endpoint☆132May 9, 2025Updated last year
- GitIngest VS Code Extension☆23Mar 15, 2025Updated last year
- Generate a simple, elegant PDF/HTML/LaTeX resume from YAML☆59Mar 12, 2026Updated last month
- A Swift container view controller to handle transitioning to a different child view controller.☆20Apr 30, 2019Updated 7 years ago
- A containerized VS Code server environment with integrated Goose AI coding assistant.☆31Mar 16, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Ixmage component for Astro☆13Jun 4, 2022Updated 3 years ago
- SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…☆29May 6, 2025Updated last year
- Host VMs, boot anywhere.☆24Dec 28, 2025Updated 4 months ago
- ☆10Jul 15, 2022Updated 3 years ago
- A simple, observable code-writing agent builder in TypeScript.☆33Apr 9, 2025Updated last year
- Nordic UART Service (NUS) console☆13Aug 1, 2018Updated 7 years ago
- A deep dive into the MLX deep learning framework☆23Jan 20, 2024Updated 2 years ago