Данный проект основан на llama.cpp и компилирует только RPC-сервер, а так же вспомогательные утилиты, работающие в режиме RPC-клиента, необходимые для реализации распределённого инференса конвертированных в GGUF формат Больших Языковых Моделей (БЯМ) и Эмбеддинговых Моделей.
☆24May 25, 2025Updated last year
Alternatives and similar repositories for docker-llama.cpp-rpc
Users that are interested in docker-llama.cpp-rpc are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- whisper.cpp HTTP transcription server with OpenAI-like API in Docker☆32Apr 5, 2026Updated 2 months ago
- ☆34Jan 25, 2026Updated 5 months ago
- Community guides and tips for xVASynth☆17Jul 26, 2022Updated 3 years ago
- LLM inference in C/C++☆23Oct 4, 2024Updated last year
- deep learning based object detection using YOLOv3 with OpenCV☆13Jul 21, 2020Updated 5 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Explore semantic caching to reduce your OpenAI/LLM API bill☆11Jul 21, 2023Updated 2 years ago
- Graceful Shutdown Manager for Go☆37Nov 22, 2024Updated last year
- openai-proxy-vercel☆12Aug 11, 2023Updated 2 years ago
- Work with your business data using natural language☆20Nov 20, 2024Updated last year
- Copilot with deepseek and more...☆13Mar 7, 2025Updated last year
- Unified System Interface, framework, server, GUI and Remote API☆15Jun 26, 2026Updated last week
- A PyTorch native library for large model training☆29Apr 1, 2026Updated 3 months ago
- cursor logs with gpt-4o using litellm proxy☆14Sep 9, 2025Updated 9 months ago
- A simple implementation of anti-spam bot for itmo opensource chat☆11Sep 29, 2025Updated 9 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This project explores my adventures doing a deep dive of OpenAI embeddings with Neo4j during the Fixie AI + LLM Hackathon on Saturday, Se…☆15Sep 19, 2023Updated 2 years ago
- rhasspy/piper Native Messaging host for TTS streaming☆10Aug 17, 2025Updated 10 months ago
- ☆11Aug 5, 2024Updated last year
- ☆14Aug 22, 2024Updated last year
- Molecular Reinforcement Learning☆14Mar 29, 2023Updated 3 years ago
- Modified Arena-Hard-Auto LLM evaluation toolkit with an emphasis on Russian language☆47Mar 20, 2025Updated last year
- 【python】初體驗-俄羅斯方塊遊戲☆11Apr 4, 2020Updated 6 years ago
- ☆14Mar 18, 2025Updated last year
- This is a simple example of how to run the android ADK feature on a basic Arduino Uno with USB Host Shield.☆14May 24, 2011Updated 15 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆14Mar 6, 2024Updated 2 years ago
- This repository contains a ready-to-use boilerplate for quickly setting up and working with crewai. It provides essential configurations …☆11Sep 11, 2024Updated last year
- A simple python script to follow stock market papers in your portfolio☆12Jun 2, 2026Updated last month
- Build your own offline AI from any documents. Free. No coding. LoRA fine-tuning + RAG + GGUF export.☆97Mar 21, 2026Updated 3 months ago
- ☆16Jan 23, 2025Updated last year
- A simple Python + Tkinter + Tesseract-based GUI image-to-text copypaste pad application☆10Sep 14, 2023Updated 2 years ago
- Repository mirror of GitLab: https://gitlab.com/rosarior/awesome-django http://awesome-django.com☆16Feb 22, 2018Updated 8 years ago
- Building AI Devops Assistant with Langchain, Postgres, and Ollama☆13Jun 12, 2024Updated 2 years ago
- ☆22Oct 1, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- All-in-one Speech Transcription☆11Jun 5, 2026Updated 3 weeks ago
- This project provides an API interface for AI chat functionality, utilizing the ai-chats.org service. It's built with FastAPI and can be …☆16Jul 27, 2024Updated last year
- Python implementation and visualization of the Shazam Audio Search Algorithm.☆10Oct 24, 2020Updated 5 years ago
- Pulsarego is a lightweight server monitoring tool written in Go, designed to continuously check the status of servers or web services and…☆21Sep 20, 2024Updated last year
- Lucene Search Module for Magento☆22Oct 10, 2010Updated 15 years ago
- Swift package for seamless audio recording and playback in iOS apps.☆14Sep 27, 2024Updated last year
- Pure-PyTorch Parakeet TDT inference☆48Mar 10, 2026Updated 3 months ago