Run llama.cpp on RunPod
☆27Sep 5, 2023Updated 2 years ago
Alternatives and similar repositories for llama-runpod
Users that are interested in llama-runpod are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- DeepDip, a DRL Gym agent that plays no-press Diplomacy in BANDANA☆13Jul 22, 2019Updated 6 years ago
- Quick hack job to allow use with Sillytavern. This works for me, some further updates are expected to expose more settings to sillytavern☆11May 30, 2024Updated 2 years ago
- FreeSWITCH Event Socket Protocol client implementation with Elixir☆12Apr 7, 2026Updated 2 months ago
- ☆16Oct 4, 2024Updated last year
- Yet Another (LLM) Web UI, made with Gemini☆12Dec 25, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆11Feb 20, 2025Updated last year
- A wannabe Ollama equivalent for Apple MlX models☆83Mar 2, 2025Updated last year
- Trace LLM calls (and others) and visualize them in WandB, as interactive SVG or using a streaming local webapp☆14Feb 18, 2025Updated last year
- PyHOP is a simple Hierarchical Task Network (HTN) planner written in Python; here is a C++ port of PyHop.☆15Jul 17, 2021Updated 4 years ago
- UE5 MediaPipe free plugin motion capture and facial☆13Feb 25, 2023Updated 3 years ago
- convert a saved pytorch model to gguf and generate as much corresponding ggml c code as possible☆15Dec 19, 2023Updated 2 years ago
- An MCP server implementation providing a standardized interface for LLMs to interact with the Atla API.☆18Jul 21, 2025Updated 11 months ago
- JAX implementation of the T5 model: Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer☆24Jun 10, 2023Updated 3 years ago
- Protocol for Augmented Memory of Project Artifacts (MCP compatible) - extended☆25Jan 24, 2026Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- An OpenAI API compatible FastAPI server that sits on top of the Anemll repo. Tested with Open WebUI.☆21Jan 21, 2026Updated 5 months ago
- Pose Asset with visemes for Epic's MetaHuman face skeleton☆54Jul 20, 2022Updated 3 years ago
- input aspect ratio, output dimensions☆21Mar 13, 2026Updated 3 months ago
- Serve FHIR standard in json, yml, etc.☆13Mar 17, 2019Updated 7 years ago
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆33Mar 22, 2026Updated 3 months ago
- ☆22Sep 20, 2025Updated 9 months ago
- A streaming local chatbot☆34Jul 3, 2025Updated 11 months ago
- An fully autonomous agent that accesses the browser and performs tasks.☆18Apr 25, 2025Updated last year
- Saas Landing Page is design inspire from https://uikit.to/saas-landing-pages/☆12Jun 18, 2022Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Run jobs with now.sh☆19Dec 8, 2022Updated 3 years ago
- This repository hosts public protocols for the IBC project.☆11Feb 24, 2026Updated 4 months ago
- The React Material UI interface for NiiVue☆13Nov 15, 2023Updated 2 years ago
- Agentic BYOK Browser-Based Website Builder☆48Jun 15, 2026Updated 2 weeks ago
- ☆52Nov 17, 2025Updated 7 months ago
- Poetry binary builds☆21May 27, 2024Updated 2 years ago
- Fork of the differential privacy module of TF/models/research☆14Oct 9, 2019Updated 6 years ago
- ☆21Jan 25, 2025Updated last year
- Tools for the LLaMA language model☆12Apr 4, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Ollama RAG Tutorials☆16Jul 2, 2024Updated last year
- ☆10Dec 9, 2022Updated 3 years ago
- Application web pédagogique pour expliquer le calcul de la taxe d'habitation 2017☆17May 27, 2019Updated 7 years ago
- An Erlang/Elixir port for scripting application logic in Lua. Works with Lua and LuaJIT.☆27Aug 12, 2023Updated 2 years ago
- Implementation☆27Mar 22, 2025Updated last year
- ☆11Nov 27, 2013Updated 12 years ago
- A meta-framework for self-improving LLMs with transparent reasoning☆42Dec 10, 2025Updated 6 months ago