A toolkit for applying LLMs to sensitive, non-public data in offline or restricted environments
☆839May 8, 2026Updated 3 weeks ago
Alternatives and similar repositories for onprem
Users that are interested in onprem are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- An LLM-based autonomous agent controlling real-world applications via RESTful APIs☆1,397Jun 7, 2024Updated last year
- Seamlessly integrate LLMs as Python functions☆2,408Mar 11, 2026Updated 2 months ago
- Turn expensive prompts into cheap fine-tuned models☆2,807May 25, 2024Updated 2 years ago
- Toolkit for fine-tuning, ablating and unit-testing open-source LLMs.☆872May 4, 2026Updated 3 weeks ago
- RAG based tool for indexing and searching PDF text data using OpenAI API and FAISS (Facebook AI Similarity Search) index, designed for ra…☆677Nov 2, 2025Updated 6 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- AI-managed code blocks in Python ⏪⏩☆466Oct 5, 2023Updated 2 years ago
- An extensible, easy-to-use, and portable diffusion web UI 👨🎨☆1,670Aug 18, 2023Updated 2 years ago
- ☆616Mar 4, 2024Updated 2 years ago
- A real world full-stack application using LlamaIndex☆2,599Mar 12, 2025Updated last year
- A tool to analyze and debug neural networks in pytorch. Use a GUI to traverse the computation graph and view the data from many different…☆298Dec 7, 2024Updated last year
- prompt2model - Generate Deployable Models from Natural Language Instructions☆2,015Dec 29, 2024Updated last year
- Run any open-source LLMs, such as DeepSeek and Llama, as OpenAI compatible API endpoint in the cloud.☆12,329May 18, 2026Updated last week
- Create and share easy-to-make, built-to-last, innovative, and customizable experiences☆33Feb 21, 2024Updated 2 years ago
- AutoChain: Build lightweight, extensible, and testable LLM Agents☆1,875Dec 16, 2025Updated 5 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…☆3,035Feb 11, 2026Updated 3 months ago
- Interact with your documents using the power of GPT, 100% privately, no data leaks☆57,217Feb 26, 2026Updated 3 months ago
- Llama 2 Everywhere (L2E)☆1,526Aug 27, 2025Updated 9 months ago
- ☆9,664Oct 16, 2025Updated 7 months ago
- Local voice chatbot for engaging conversations, powered by Ollama, Hugging Face Transformers, and Coqui TTS Toolkit☆784Aug 12, 2024Updated last year
- Create API agents from OpenAPI Specs☆186Nov 12, 2023Updated 2 years ago
- A Kurtosis package for Python data engineers, deploying a Jupyter notebook along with a configurable set of databases, and a visualizatio…☆109Dec 4, 2023Updated 2 years ago
- 🤖 Deploy a private ChatGPT alternative hosted within your VPC. 🔮 Connect it to your organization's knowledge base and use it as a corpo…☆1,588Sep 11, 2023Updated 2 years ago
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR mode…☆916Apr 28, 2026Updated last month
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Zep | Examples, Integrations, & More☆4,597Apr 9, 2026Updated last month
- Complex LLM Workflows from Simple JSON.☆322Aug 11, 2023Updated 2 years ago
- A guidance language for controlling large language models.☆21,473May 21, 2026Updated last week
- Superagent protects your AI applications against prompt injections, data leaks, and harmful outputs. Embed safety directly into your app …☆6,613Apr 11, 2026Updated last month
- Convert your ChatGPT export (ZIP) into clean Markdown text files with inline media, and generate data visualizations like word clouds and…☆857May 11, 2026Updated 2 weeks ago
- 💡 All-in-one AI framework for semantic search, LLM orchestration and language model workflows☆12,607May 22, 2026Updated last week
- LLocalSearch is a completely locally running search aggregator using LLM Agents. The user can ask a question and the system will use a ch…☆5,955Mar 24, 2026Updated 2 months ago
- A RAG LLM co-pilot for browsing the web, powered by local LLMs☆1,513Jan 26, 2025Updated last year
- Structured Outputs☆13,891May 18, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,870Apr 13, 2026Updated last month
- A tiny library for coding with large language models.☆1,234Jul 10, 2024Updated last year
- 🌸 Run LLMs at home, BitTorrent-style. Fine-tuning and inference up to 10x faster than offloading☆10,149Sep 7, 2024Updated last year
- Talk to AI modes in terminal. Bard|GPT3.5|Llama2☆156Jan 25, 2024Updated 2 years ago
- A fast inference library for running LLMs locally on modern consumer-class GPUs☆4,531Mar 4, 2026Updated 2 months ago
- Numbers every LLM developer should know☆4,300Jan 16, 2024Updated 2 years ago
- A comprehensive guide to building RAG-based LLM applications for production.☆1,854Aug 2, 2024Updated last year