A toolkit for applying LLMs to sensitive, non-public data in offline or restricted environments
☆812Feb 13, 2026Updated 3 weeks ago
Alternatives and similar repositories for onprem
Users that are interested in onprem are comparing it to the libraries listed below
Sorting:
- An LLM-based autonomous agent controlling real-world applications via RESTful APIs☆1,390Jun 7, 2024Updated last year
- Seamlessly integrate LLMs as Python functions☆2,389Nov 24, 2025Updated 3 months ago
- AI-managed code blocks in Python ⏪⏩☆467Oct 5, 2023Updated 2 years ago
- Turn expensive prompts into cheap fine-tuned models☆2,784May 25, 2024Updated last year
- Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.☆870Oct 25, 2024Updated last year
- ☆607Mar 4, 2024Updated 2 years ago
- An extensible, easy-to-use, and portable diffusion web UI 👨🎨☆1,672Aug 18, 2023Updated 2 years ago
- RAG based tool for indexing and searching PDF text data using OpenAI API and FAISS (Facebook AI Similarity Search) index, designed for ra…☆680Nov 2, 2025Updated 4 months ago
- Create and share easy-to-make, built-to-last, innovative, and customizable experiences☆34Feb 21, 2024Updated 2 years ago
- Get 100% uptime, reliability from OpenAI. Handle Rate Limit, Timeout, API, Keys Errors☆698Nov 20, 2023Updated 2 years ago
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…☆3,022Feb 11, 2026Updated 3 weeks ago
- AutoChain: Build lightweight, extensible, and testable LLM Agents☆1,870Dec 16, 2025Updated 2 months ago
- A real world full-stack application using LlamaIndex☆2,593Mar 12, 2025Updated 11 months ago
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆12,148Updated this week
- ☆9,666Oct 16, 2025Updated 4 months ago
- Llama 2 Everywhere (L2E)☆1,529Aug 27, 2025Updated 6 months ago
- prompt2model - Generate Deployable Models from Natural Language Instructions☆2,009Dec 29, 2024Updated last year
- 🤖 Deploy a private ChatGPT alternative hosted within your VPC. 🔮 Connect it to your organization's knowledge base and use it as a corpo…☆1,506Sep 11, 2023Updated 2 years ago
- ☆25Sep 19, 2023Updated 2 years ago
- Structured Outputs☆13,488Updated this week
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆783Aug 12, 2024Updated last year
- A guidance language for controlling large language models.☆21,327Feb 13, 2026Updated 3 weeks ago
- A RAG LLM co-pilot for browsing the web, powered by local LLMs☆1,515Jan 26, 2025Updated last year
- Zep | Examples, Integrations, & More☆4,121Updated this week
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,444Dec 9, 2025Updated 2 months ago
- Complex LLM Workflows from Simple JSON.☆320Aug 11, 2023Updated 2 years ago
- Generative AutoML for Tabular Data☆446Feb 3, 2025Updated last year
- Numbers every LLM developer should know☆4,287Jan 16, 2024Updated 2 years ago
- Convert your ChatGPT export (ZIP) into clean Markdown text files with inline media, and generate data visualizations like word clouds and…☆839Updated this week
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,247Feb 25, 2026Updated last week
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Jan 19, 2024Updated 2 years ago
- A Kurtosis package for Python data engineers, deploying a Jupyter notebook along with a configurable set of databases, and a visualizatio…☆109Dec 4, 2023Updated 2 years ago
- Run any ML model from any programming language.☆422Jan 15, 2024Updated 2 years ago
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,734Feb 9, 2026Updated 3 weeks ago
- Create API agents from OpenAPI Specs☆186Nov 12, 2023Updated 2 years ago
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆9,971Sep 7, 2024Updated last year
- Code Indexer Loop is a Python library for indexing and retrieving source code files through an integrated vector database that's continuo…☆176Apr 9, 2024Updated last year
- Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app …☆6,422Feb 3, 2026Updated last month
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR mode…☆913Jan 2, 2026Updated 2 months ago