This Open LLM Framework serves as a powerful and flexible tool for serving endpoints for embeddings and chat completions using SOTA open source language models. By leveraging models Transformers, this enables various natural language processing (NLP) tasks to be performed via simple HTTP endpoints similar to openai endpoints.
☆21Sep 4, 2024Updated last year
Alternatives and similar repositories for open-llm-server
Users that are interested in open-llm-server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Local models support for Microsoft's graphrag using ollama (llama3, mistral, gemma2 phi3)- LLM & Embedding extraction☆1,103May 8, 2026Updated last month
- This repo is my settings for using the local LLM with graphrag & an UI to chat with the index result☆16Jul 24, 2024Updated last year
- A modular graph-based Retrieval-Augmented Generation (RAG) system☆20Sep 5, 2024Updated last year
- Raycast extension for usememos/memos.☆11Mar 8, 2025Updated last year
- Useful snippets for your chrome/firefox browsers☆16Apr 10, 2022Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A template to create your own literature survey engine☆14Updated this week