EvilFreelancer / docker-llama.cpp-rpcLinks
Данный проект основан на llama.cpp и компилирует только RPC-сервер, а так же вспомогательные утилиты, работающие в режиме RPC-клиента, необходимые для реализации распределённого инференса конвертированных в GGUF формат Больших Языковых Моделей (БЯМ) и Эмб еддинговых Моделей.
☆23Updated 8 months ago
Alternatives and similar repositories for docker-llama.cpp-rpc
Users that are interested in docker-llama.cpp-rpc are comparing it to the libraries listed below
Sorting:
- whisper.cpp HTTP transcription server with OpenAI-like API in Docker☆28Updated 5 months ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆50Updated 8 months ago
- Dialoqbase Lite is a Chrome extension that offers a web-based UI and a side panel, Copilot, designed specifically for almost all AI provi…☆43Updated 9 months ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆14Updated 5 months ago
- AI agent to automatically check grammar and spelling on documentation files☆94Updated 2 months ago
- A sleek, customizable interface for managing LLMs with responsive design and easy agent personalization.☆17Updated last year
- Talk to YouTube☆41Updated 2 years ago
- Hector RAG is a modular RAG framework built on PostgreSQL, offering advanced retrieval methods and fusion techniques for AI-driven applic…☆60Updated 11 months ago
- High-performance lightweight proxy and load balancer for LLM infrastructure. Intelligent routing, automatic failover and unified model di…☆131Updated last week
- Kroko ASR - Speech-to-text☆130Updated 3 months ago
- A QT GUI for large language models☆39Updated 2 years ago
- AI-augmented, conversational information retrieval and data exploration☆37Updated last year
- 🐝 Create powerful, collaborative AI applications.☆65Updated last year
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Updated last year
- ☆31Updated last year
- Download models from the Ollama library, without Ollama☆121Updated last year
- Training and data processing code for Saiga☆54Updated 3 weeks ago
- ☆60Updated last month
- Telegram LLM bot backed by OpenAI, Whisper, Beam, LLaMA, Weaviate, MinIO and MongoDB☆109Updated 2 years ago
- ☆18Updated 8 months ago
- LLaMA Server combines the power of LLaMA C++ with the beauty of Chatbot UI.☆130Updated 2 years ago
- OpenAPI-like API-server for voice generation (TTS) based on fish-speech-1.5 model.☆29Updated 8 months ago
- Elasticsearch integration into LangChain☆72Updated last month
- a browser gui for nvidia smi☆20Updated 10 months ago
- A tool for an analysis of LLM generations.☆42Updated 3 months ago
- Complex RAG backend☆29Updated last year
- best llms in russian☆62Updated last year
- Useful tools and links for Llama 2.☆48Updated 2 years ago
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagrams☆38Updated 9 months ago
- LLM Chat is an open-source serverless alternative to ChatGPT.☆36Updated last year