EvilFreelancer / docker-llama.cpp-rpcLinks
Данный проект основан на llama.cpp и компилирует только RPC-сервер, а так же вспомогательные утилиты, работающие в режиме RPC-клиента, необходимые для реализации распределённого инференса конвертированных в GGUF формат Больших Языковых Моделей (БЯМ) и Эмбеддинговых Моделей.
☆23Updated 8 months ago
Alternatives and similar repositories for docker-llama.cpp-rpc
Users that are interested in docker-llama.cpp-rpc are comparing it to the libraries listed below
Sorting:
- whisper.cpp HTTP transcription server with OpenAI-like API in Docker☆29Updated 6 months ago
- Dialoqbase Lite is a Chrome extension that offers a web-based UI and a side panel, Copilot, designed specifically for almost all AI provi…☆43Updated 9 months ago
- AI agent to automatically check grammar and spelling on documentation files☆95Updated 2 months ago
- OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.☆29Updated 8 months ago
- ☆31Updated last year
- A TypeScript library for building orchestrated framework-agnostic multi-agent AI systems☆33Updated 2 weeks ago
- Trim and timestamp audio, in the terminal☆14Updated last year
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagrams☆41Updated 10 months ago
- LLM Chat is an open-source serverless alternative to ChatGPT.☆36Updated last year
- A tool for an analysis of LLM generations.☆42Updated 3 months ago
- Hector RAG is a modular RAG framework built on PostgreSQL, offering advanced retrieval methods and fusion techniques for AI-driven applic…☆60Updated 11 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Updated last year
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Updated 5 months ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆50Updated 8 months ago
- fork of litellm that is open source☆21Updated 2 weeks ago
- LM Studio: RAG (Retrieval-Augmented Generation) Local LLM vs GPT-4☆21Updated 2 years ago
- A Multi-Agentic AI Assistant/Builder☆25Updated 2 weeks ago
- best llms in russian☆62Updated last year
- CLaMR: Contextualized Late-Interaction for Multimodal Content Retrieval☆23Updated 7 months ago
- Odin Runes, a java-based GPT client, facilitates interaction with your preferred GPT model right through your favorite text editor. There…☆86Updated last year
- From-scratch implementation of OpenAI's GPT-OSS model in Python. No Torch, No GPUs.☆108Updated 3 months ago
- A simple, interactive web tool to compare pricing and performance metrics of various AI models.☆16Updated last month
- ☆60Updated last month
- AI Coding assistant for large and complex codebases.☆157Updated 11 months ago
- My version of an LLM Websearch Agent using a local SearXNG server because SearXNG is great.☆39Updated last week
- Thin wrapper around OpenAI Whisper API with streaming support☆86Updated 2 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆119Updated last year
- A tool for summarizing dialogues from videos or audio☆83Updated 2 years ago
- Download models from the Ollama library, without Ollama☆122Updated last year
- I’m trying to create something similar to Grammarly. Hail to open source!☆15Updated 8 months ago