Run llama.cpp in a GPU accelerated Docker container
☆64Apr 3, 2026Updated last month
Alternatives and similar repositories for llama-cpp-docker
Users that are interested in llama-cpp-docker are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆22Feb 5, 2025Updated last year
- ☆12Oct 22, 2023Updated 2 years ago
- 【Equim 自用 fork。diverse 越来越多所以会用 cherry-pick 而不是 merge】用 Express 和 Vue3 搭建的 ChatGPT 演示网页☆10Jun 20, 2023Updated 2 years ago
- A Qrcode Scanner with back camera as default☆15Aug 10, 2025Updated 8 months ago
- Enemies for your LLM☆35Jan 20, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Test-Time Memory Framework: Control Hallucinations in Foundation Models☆11Nov 4, 2025Updated 6 months ago
- File monitor for wafer maps, tester files, or about anything.☆13Aug 15, 2018Updated 7 years ago
- ☆28Mar 30, 2026Updated last month
- This repo has moved! See new URL in README or below☆30May 24, 2024Updated last year
- imageC / EVAnalyzer2 - High throughput biological image processor☆10Apr 24, 2026Updated last week
- Node.js module providing inference APIs for large language models, with simple CLI.☆23Dec 7, 2024Updated last year
- Course Materials for ML Course at Tsinghua☆28Dec 17, 2019Updated 6 years ago
- ☆19Jun 20, 2025Updated 10 months ago
- An MCP server implementation providing a standardized interface for LLMs to interact with the Atla API.☆18Jul 21, 2025Updated 9 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Build a photographic mosaic! Supports various RGB euclidean color difference algorithm☆11Nov 4, 2020Updated 5 years ago
- ComfyUI-Direct3D‑S2 is now available in ComfyUI, Direct3D‑S2 - Gigascale 3D Generation Made Easy with Spatial Sparse Attention. Direct3D‑…☆17Jun 10, 2025Updated 10 months ago
- Simple Bayesian Belief Network inference library using approximate and exact methods for Java.☆19Nov 16, 2022Updated 3 years ago
- A GPU-accelerated image generation toolkit for building image tiles and MRF files on-demand from Earth science data☆15Oct 5, 2023Updated 2 years ago
- ☆10Nov 17, 2024Updated last year
- resources, links for OCR & greek☆10Mar 8, 2021Updated 5 years ago
- [ICLR 2025] Large (Vision) Language Models are Unsupervised In-Context Learners☆22Jun 6, 2025Updated 10 months ago
- Flexible and transparent Python Boruta implementation☆15Jun 8, 2025Updated 10 months ago
- Ask question over your Notion Database! A naive Retrieval-Augmented Generation (RAG) pipeline backed by Langchain and Streamlit☆39Dec 30, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆21Jul 28, 2020Updated 5 years ago
- A port of the General Polygon Clipper☆17Sep 13, 2022Updated 3 years ago
- ☆12Apr 29, 2024Updated 2 years ago
- OpenAI、Gemini Pro Proxy | 代理服务☆33Apr 16, 2024Updated 2 years ago
- ☆18Apr 22, 2026Updated last week
- ☆28Nov 10, 2025Updated 5 months ago
- A fast minimalistic implementation of guided generation on Apple Silicon using Outlines and MLX☆59Feb 9, 2024Updated 2 years ago
- This fully reconfigurable action, validates conformity with Azure Developer CLI template standards.☆22Apr 8, 2026Updated 3 weeks ago
- A forest of autonomous agents.☆20Jan 27, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- MCP server for the Delinea Secret Server and Platform APIs☆46Apr 25, 2026Updated last week
- TUI kanban board for orchestrating AI coding agents☆97Jan 28, 2026Updated 3 months ago
- [ICLR 2026] Information Gain-based Policy Optimization: A Simple and Effective Approach for Multi-Turn Search Agents☆61Apr 23, 2026Updated last week
- The benchmark for LLMs designed to tackle one of the most knowledge-intensive tasks in data science: writing feature engineering code, wh…☆21Oct 28, 2024Updated last year
- Measuring RAG solutions throughput and latency☆20Jul 23, 2024Updated last year
- See also APPL: https://github.com/appl-team/appl that improves this project. A Python package for writing Language Models prompts in a ne…☆41Oct 2, 2023Updated 2 years ago
- Quick Notebook Tutorials☆36Jul 17, 2025Updated 9 months ago