Run llama.cpp in a GPU accelerated Docker container
☆65Apr 3, 2026Updated 3 months ago
Alternatives and similar repositories for llama-cpp-docker
Users that are interested in llama-cpp-docker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Feb 5, 2025Updated last year
- Degit: Decentralized version control with rewards☆12Mar 16, 2022Updated 4 years ago
- Enemies for your LLM☆38Jan 20, 2026Updated 5 months ago
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 8 months ago
- A C++ implementation of tinyllama inference on CPU.☆17Feb 28, 2024Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code and instructions for deploying a smaller open-source Language Large Model (LLM) on AWS Lambda, using Python, Docker.☆10Jun 13, 2024Updated 2 years ago
- Publish local LLMs and LLM apps on the internet.☆27Aug 17, 2025Updated 10 months ago
- This repo has moved! See new URL in README or below☆30May 24, 2024Updated 2 years ago
- A lightweight, dependency-free, file-based NoSQL database for Python with simple collection/document APIs.☆19Mar 16, 2024Updated 2 years ago
- ☆19Jun 20, 2025Updated last year
- An MCP server implementation providing a standardized interface for LLMs to interact with the Atla API.☆17Jul 21, 2025Updated 11 months ago
- 🚀 LLM-I: Transform LLMs into natural interleaved multimodal creators! ✨ Tool-use framework supporting image search, generation, code ex…☆41Oct 20, 2025Updated 8 months ago
- resources, links for OCR & greek☆11Mar 8, 2021Updated 5 years ago
- ☆10May 31, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners☆22Jun 6, 2025Updated last year
- ☆12Apr 29, 2024Updated 2 years ago
- ☆19Apr 22, 2026Updated 2 months ago
- ☆28Jun 2, 2026Updated last month
- A demonstration of the paper NER Retriever: Zero-Shot Named Entity Retrieval with Type-Aware Embeddings☆39Sep 13, 2025Updated 9 months ago
- MCP server for the Delinea Secret Server and Platform APIs☆46Jun 27, 2026Updated last week
- The benchmark for LLMs designed to tackle one of the most knowledge-intensive tasks in data science: writing feature engineering code, wh…☆22Oct 28, 2024Updated last year
- Measuring RAG solutions throughput and latency☆20Jul 23, 2024Updated last year
- verl: Volcano Engine Reinforcement Learning for LLMs☆42Jun 23, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Generative AI playground using Ollama, OpenAI API and JavaScript. Try AI models in your browser!☆26Jun 22, 2026Updated last week
- [ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents☆115Apr 23, 2026Updated 2 months ago
- ☆80Feb 18, 2026Updated 4 months ago
- Implementation of Paper: Long-term Forecasting with TiDE: Time-series Dense Encoder☆21Nov 1, 2024Updated last year
- LLMs represent numbers on a helix and manipulate that helix to do addition.☆31Feb 4, 2025Updated last year
- Configurable Pipecat speech control assistant with configurable tools and sub-agent support☆38Sep 5, 2025Updated 10 months ago
- Search movies using RAG and LLMs☆19Sep 4, 2024Updated last year
- This repository contains a Multimodal Retrieval-Augmented Generation (RAG) Pipeline that integrates images, audio, and text for advanced …☆27Jan 19, 2025Updated last year
- Run Qwen3.5-35B-A3B with llama.cpp and openclaw on NVIDIA DGX Spark (GB10)☆72Mar 1, 2026Updated 4 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆16Oct 19, 2024Updated last year
- AI-powered product search platform built on Google Cloud Platform (GCP). Leverages Spanner's hybrid search capabilities (vector similarit…☆26Apr 28, 2026Updated 2 months ago
- On-device real-time RAG App built using Jina Reader, Mediapipe, Gemma 2b IT LLM.☆15Apr 15, 2024Updated 2 years ago
- ☆16Aug 29, 2024Updated last year
- treelite runtime binding in Rust☆12Jun 12, 2025Updated last year
- [ICLR 2026] Durian: Dual Reference Image-Guided Portrait Animation with Attribute Transfer☆44Apr 13, 2026Updated 2 months ago
- Run fast LLM Inference using Llama.cpp in Python☆19Jan 3, 2024Updated 2 years ago