Run llama.cpp on RunPod
☆26Sep 5, 2023Updated 2 years ago
Alternatives and similar repositories for llama-runpod
Users that are interested in llama-runpod are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AI Coders search blindly. Be their guide.☆37Mar 3, 2026Updated 2 months ago
- Yet Another (LLM) Web UI, made with Gemini☆12Dec 25, 2024Updated last year
- ☆11Feb 20, 2025Updated last year
- A wannabe Ollama equivalent for Apple MlX models☆86Mar 2, 2025Updated last year
- Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp☆14Feb 18, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- PyHOP is a simple Hierarchical Task Network (HTN) planner written in Python; here is a C++ port of PyHop.☆15Jul 17, 2021Updated 4 years ago
- UE5 MediaPipe free plugin motion capture and facial☆13Feb 25, 2023Updated 3 years ago
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible☆15Dec 19, 2023Updated 2 years ago
- An MCP server implementation providing a standardized interface for LLMs to interact with the Atla API.☆18Jul 21, 2025Updated 10 months ago
- WorldSense benchmark for grounded reasoning in language models☆24Nov 28, 2023Updated 2 years ago
- One stop shop - Local-first RAG stack with intelligent polyglot-code/docs, remote code execution, local llama enrichment, progressive dis…☆34Feb 17, 2026Updated 3 months ago
- An OpenAI API compatible FastAPI server that sits on top of the Anemll repo. Tested with Open WebUI.☆21Jan 21, 2026Updated 4 months ago
- input aspect ratio, output dimensions☆21Mar 13, 2026Updated 2 months ago
- fast + parallel AlphaZero in PyTorch☆15Jan 21, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆33Mar 22, 2026Updated 2 months ago
- A streaming local chatbot☆34Jul 3, 2025Updated 10 months ago
- An fully autonomous agent that accesses the browser and performs tasks.☆18Apr 25, 2025Updated last year
- Saas Landing Page is design inspire from https://uikit.to/saas-landing-pages/☆12Jun 18, 2022Updated 3 years ago
- Django-based visualiser for tournaments for the boardgame Diplomacy☆10Apr 29, 2026Updated last month
- Agentic BYOK Browser-Based Website Builder☆45Updated this week
- ☆51Nov 17, 2025Updated 6 months ago
- Poetry binary builds☆22May 27, 2024Updated 2 years ago
- Create embeddings with infinity as serverless endpoint☆45Nov 21, 2025Updated 6 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A Field-Theoretic Approach to Unbounded Memory in Large Language Models☆20Apr 15, 2025Updated last year
- ☆21Jan 25, 2025Updated last year
- Ollama RAG Tutorials☆16Jul 2, 2024Updated last year
- Tools for the LLaMA language model☆12Apr 4, 2023Updated 3 years ago
- Implementation☆27Mar 22, 2025Updated last year
- A meta-framework for self-improving LLMs with transparent reasoning☆38Dec 10, 2025Updated 5 months ago
- A dataset of news headlines for detecting causalities☆14May 9, 2022Updated 4 years ago
- A Python reimplementation + extension of "Planning with Large Language Models for Code Generation" (https://arxiv.org/abs/2303.05510)☆17Dec 1, 2023Updated 2 years ago
- common go code☆14Feb 22, 2021Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆21Sep 11, 2023Updated 2 years ago
- ☆19Dec 9, 2023Updated 2 years ago
- GitIngest VS Code Extension☆23Mar 15, 2025Updated last year
- Starting point to build your own custom serverless endpoint☆135May 9, 2025Updated last year
- Official implementation repository for the paper Towards General Conceptual Model Editing via Adversarial Representation Engineering.☆20Dec 6, 2024Updated last year
- SPLAA is an AI assistant framework that utilizes voice recognition, text-to-speech, and tool-calling capabilities to provide a conversati…☆29May 6, 2025Updated last year
- Host VMs, boot anywhere.☆24Dec 28, 2025Updated 5 months ago