amaiya / onprem
A tool for running on-premises large language models with non-public data
☆705Updated this week
Alternatives and similar repositories for onprem:
Users that are interested in onprem are comparing it to the libraries listed below
- Agents Capable of Self-Editing Their Prompts / Python Code☆753Updated 10 months ago
- Marsha is a functional, higher-level, English-based programming language that gets compiled into tested Python software by an LLM☆469Updated last year
- A FastAPI service for semantic text search using precomputed embeddings and advanced similarity measures, with built-in support for vario…☆990Updated 3 months ago
- AI-managed code blocks in Python ⏪⏩☆469Updated last year
- Large language model evaluation and workflow framework from Phase AI.☆453Updated last week
- A fast and minimal framework for building agent-integrated systems☆419Updated 6 months ago
- Instruct-tune LLaMA on consumer hardware☆362Updated last year
- A minimal Python package for storing and retrieving text using chunking, embeddings, and vector search.☆669Updated 3 months ago
- Structured and typehinted GPT responses in Python☆735Updated 6 months ago
- This project provides an API with user level access support to transcribe speech to text using a finetuned and processed Whisper ASR mode…☆878Updated last year
- ☆737Updated 9 months ago
- A script to effortlessly extract your entire ChatGPT data export from JSON files to nicely-formatted markdown files.☆720Updated 6 months ago
- A tiny nearest-neighbor embedding database built with SQLite and Pytorch. (In development!)☆771Updated last year
- clean & curate your data with LLMs.☆473Updated 7 months ago
- Neum AI is a best-in-class framework to manage the creation and synchronization of vector embeddings at large scale.☆850Updated last year
- Build robust LLM applications with true composability 🔗☆414Updated last year
- fast vector database made in numpy☆750Updated 9 months ago
- This project collects GPU benchmarks from various cloud providers and compares them to fixed per token costs. Use our tool for efficient …☆216Updated last month
- Count and truncate text based on tokens☆288Updated 8 months ago
- Stateful load balancer custom-tailored for llama.cpp 🏓🦙☆676Updated last week
- Demo of twilio☆270Updated 11 months ago
- Prompt engineering for developers☆676Updated 11 months ago
- Promptr is a CLI tool that applies plain language instructions to the filesystem. Instructions can utilize a liquidjs based templating sy…☆916Updated last month
- LLMFlows - Simple, Explicit and Transparent LLM Apps☆679Updated this week
- Complex LLM Workflows from Simple JSON.☆290Updated last year
- Wanderlust OpenAI example using Solara☆215Updated last year
- Turn expensive prompts into cheap fine-tuned models☆2,528Updated 8 months ago
- A reactive runtime for building durable AI agents☆1,295Updated 3 weeks ago
- Build a chatbot or Q&A bot of your website's content☆528Updated last year
- A series of top performing Text to SQL LLMs☆866Updated 11 months ago