A toolkit for applying LLMs to sensitive, non-public data in offline or restricted environments
☆838Apr 22, 2026Updated 2 weeks ago
Alternatives and similar repositories for onprem
Users that are interested in onprem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An LLM-based autonomous agent controlling real-world applications via RESTful APIs☆1,394Jun 7, 2024Updated last year
- Seamlessly integrate LLMs as Python functions☆2,404Mar 11, 2026Updated last month
- Turn expensive prompts into cheap fine-tuned models☆2,795May 25, 2024Updated last year
- Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.☆871Oct 25, 2024Updated last year
- RAG based tool for indexing and searching PDF text data using OpenAI API and FAISS (Facebook AI Similarity Search) index, designed for ra…☆677Nov 2, 2025Updated 6 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- AI-managed code blocks in Python ⏪⏩☆466Oct 5, 2023Updated 2 years ago
- An extensible, easy-to-use, and portable diffusion web UI 👨🎨☆1,672Aug 18, 2023Updated 2 years ago
- ☆612Mar 4, 2024Updated 2 years ago
- A real world full-stack application using LlamaIndex☆2,598Mar 12, 2025Updated last year
- A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different…☆298Dec 7, 2024Updated last year
- prompt2model - Generate Deployable Models from Natural Language Instructions☆2,015Dec 29, 2024Updated last year
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆12,311Apr 27, 2026Updated last week
- Create and share easy-to-make, built-to-last, innovative, and customizable experiences☆33Feb 21, 2024Updated 2 years ago
- AutoChain: Build lightweight, extensible, and testable LLM Agents☆1,877Dec 16, 2025Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…☆3,039Feb 11, 2026Updated 2 months ago
- Interact with your documents using the power of GPT, 100% privately, no data leaks☆57,207Feb 26, 2026Updated 2 months ago
- Llama 2 Everywhere (L2E)☆1,527Aug 27, 2025Updated 8 months ago
- ☆9,657Oct 16, 2025Updated 6 months ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆785Aug 12, 2024Updated last year
- Create API agents from OpenAPI Specs☆186Nov 12, 2023Updated 2 years ago
- A Kurtosis package for Python data engineers, deploying a Jupyter notebook along with a configurable set of databases, and a visualizatio…☆109Dec 4, 2023Updated 2 years ago
- 🤖 Deploy a private ChatGPT alternative hosted within your VPC. 🔮 Connect it to your organization's knowledge base and use it as a corpo…☆1,587Sep 11, 2023Updated 2 years ago
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR mode…☆913Apr 28, 2026Updated last week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Zep | Examples, Integrations, & More☆4,514Apr 9, 2026Updated last month
- Complex LLM Workflows from Simple JSON.☆321Aug 11, 2023Updated 2 years ago
- A guidance language for controlling large language models.☆21,420Apr 10, 2026Updated 3 weeks ago
- Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app …☆6,581Apr 11, 2026Updated 3 weeks ago
- Convert your ChatGPT export (ZIP) into clean Markdown text files with inline media, and generate data visualizations like word clouds and…☆854Apr 16, 2026Updated 3 weeks ago
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,453May 1, 2026Updated last week
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,954Mar 24, 2026Updated last month
- A RAG LLM co-pilot for browsing the web, powered by local LLMs☆1,511Jan 26, 2025Updated last year
- Structured Outputs☆13,776Apr 16, 2026Updated 3 weeks ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,858Apr 13, 2026Updated 3 weeks ago
- A tiny library for coding with large language models.☆1,233Jul 10, 2024Updated last year
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆10,109Sep 7, 2024Updated last year
- Talk to AI modes in terminal. Bard|GPT3.5|Llama2☆157Jan 25, 2024Updated 2 years ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,514Mar 4, 2026Updated 2 months ago
- Numbers every LLM developer should know☆4,301Jan 16, 2024Updated 2 years ago
- A comprehensive guide to building RAG-based LLM applications for production.☆1,856Aug 2, 2024Updated last year