preset-io / promptimizeView external linksLinks
Promptimize is a prompt engineering evaluation and testing toolkit.
☆492Jan 19, 2026Updated 3 weeks ago
Alternatives and similar repositories for promptimize
Users that are interested in promptimize are comparing it to the libraries listed below
Sorting:
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…☆3,003Updated this week
- Test your prompts, agents, and RAGs. AI Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude,…☆10,339Feb 8, 2026Updated last week
- An open-source visual programming environment for battle-testing prompts to LLMs.☆2,922Jan 2, 2026Updated last month
- Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.☆2,466Feb 3, 2026Updated last week
- an ambient intelligence library☆6,071Feb 6, 2026Updated last week
- Adding guardrails to large language models.☆6,399Updated this week
- A guidance language for controlling large language models.☆21,270Feb 6, 2026Updated last week
- structured outputs for llms☆12,357Updated this week
- 🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.☆735Updated this week
- KMD is a collection of conversational exchanges between patients and doctors on various medical topics. It aims to capture the intricaci…☆24Nov 15, 2023Updated 2 years ago
- DSPy: The framework for programming—not prompting—language models☆32,156Updated this week
- A language for constraint-guided and efficient LLM programming.☆4,148May 22, 2025Updated 8 months ago
- 🐢 Open-Source Evaluation & Testing library for LLM Agents☆5,111Feb 6, 2026Updated last week
- Structured Outputs☆13,403Feb 6, 2026Updated last week
- A Github API client to extract events and actions, and load into a database☆28Oct 22, 2021Updated 4 years ago
- 🦾 Take control of your AI agents☆1,387Aug 22, 2025Updated 5 months ago
- A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research …☆1,013Dec 16, 2024Updated last year
- The LLM Evaluation Framework☆13,613Updated this week
- λprompt - A functional programming interface for building AI systems☆380Jan 18, 2024Updated 2 years ago
- Continuous Integration for LLM powered applications☆254Aug 11, 2023Updated 2 years ago
- ☆9,644Oct 16, 2025Updated 3 months ago
- Supercharge Your LLM Application Evaluations 🚀☆12,605Jan 31, 2026Updated 2 weeks ago
- Hosted embedding platform to discover, evaluate, and retrieve embeddings☆73Sep 21, 2023Updated 2 years ago
- Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engin…☆4,561Feb 12, 2025Updated last year
- Evaluation and Tracking for LLM Experiments and AI Agents☆3,082Updated this week
- Python client library for improving your LLM app accuracy☆97Feb 11, 2025Updated last year
- Superfast AI decision making and intelligent processing of multi-modal data.☆3,286Nov 18, 2025Updated 2 months ago
- Experimental LLM agent/toolkit with direct Vim access using neovim/pynvim☆75Sep 30, 2024Updated last year
- AdalFlow: The library to build & auto-optimize LLM applications.☆4,024Updated this week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,717Updated this week
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,928Jul 11, 2025Updated 7 months ago
- A unified evaluation framework for large language models☆2,775Jan 22, 2026Updated 3 weeks ago
- Build Conversational AI in minutes ⚡️☆11,558Feb 3, 2026Updated last week
- A tool for evaluating LLMs☆428May 10, 2024Updated last year
- Simple orchestration for EC2 spot containers☆19Sep 27, 2024Updated last year
- An awesome & curated list of best LLMOps tools for developers☆5,605Feb 3, 2026Updated last week
- dstack is an open-source control plane for running development, training, and inference jobs on GPUs—across hyperscalers, neoclouds, or o…☆2,031Updated this week
- Chat Markup Language conversation library☆55Jan 3, 2024Updated 2 years ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Feb 23, 2024Updated last year