Promptimize is a prompt engineering evaluation and testing toolkit.
☆494Mar 16, 2026Updated last month
Alternatives and similar repositories for promptimize
Users that are interested in promptimize are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Open-source tools for prompt testing and experimentation, with support for both LLMs (e.g. OpenAI, LLaMA) and vector databases (e.g. Chro…☆3,032Feb 11, 2026Updated 2 months ago
- Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Ll…☆20,196Updated this week
- An open-source visual programming environment for battle-testing prompts to LLMs.☆2,975Apr 6, 2026Updated last week
- Adding guardrails to large language models.☆6,675Apr 3, 2026Updated 2 weeks ago
- A guidance language for controlling large language models.☆21,397Apr 10, 2026Updated last week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Modular Python framework for AI agents and workflows with chain-of-thought reasoning, tools, and memory.☆2,511Updated this week
- structured outputs for llms☆12,749Updated this week
- an ambient intelligence library☆6,138Updated this week
- DSPy: The framework for programming—not prompting—language models☆33,649Apr 13, 2026Updated last week
- A tool for evaluating LLMs☆429Mar 15, 2026Updated last month
- Structured Outputs☆13,657Mar 26, 2026Updated 3 weeks ago
- Continuous Integration for LLM powered applications☆257Aug 11, 2023Updated 2 years ago
- 🦾 Take control of your AI agents☆1,391Aug 22, 2025Updated 7 months ago
- λprompt - A functional programming interface for building AI systems☆377Jan 18, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.☆751Updated this week
- ☆9,656Oct 16, 2025Updated 6 months ago
- A central, open resource for data and tools related to chain-of-thought reasoning in large language models. Developed @ Samwald research …☆1,013Dec 16, 2024Updated last year
- Semantic cache for LLMs. Fully integrated with LangChain and llama_index.☆7,990Jul 11, 2025Updated 9 months ago
- 🐢 Open-Source Evaluation & Testing library for LLM Agents☆5,273Updated this week
- A language for constraint-guided and efficient LLM programming.☆4,164May 22, 2025Updated 10 months ago
- A Github API client to extract events and actions, and load into a database☆28Oct 22, 2021Updated 4 years ago
- Prompt Engineering | Prompt Versioning | Use GPT or other prompt based models to get structured output. Join our discord for Prompt-Engin…☆4,582Mar 27, 2026Updated 3 weeks ago
- Test suite for LLM prompts☆221Apr 18, 2023Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Supercharge Your LLM Application Evaluations 🚀☆13,415Feb 24, 2026Updated last month
- dbt starter code for enterprise Snowflake usage data artifacts☆21Sep 7, 2022Updated 3 years ago
- The LLM Evaluation Framework☆14,728Apr 9, 2026Updated last week
- Scalable and efficient data transformation framework - backwards compatible with dbt.☆3,029Apr 8, 2026Updated last week
- Waffer-thin FlaskGPT on Vercel.☆12Jun 1, 2023Updated 2 years ago
- Build Conversational AI in minutes ⚡️☆11,967Apr 9, 2026Updated last week
- Evaluation and Tracking for LLM Experiments and AI Agents☆3,257Updated this week
- Gorilla: Training and Evaluating LLMs for Function Calls (Tool Calls)☆12,838Apr 13, 2026Updated last week
- Control plane for agents and engineers to provision compute and run training and inference across NVIDIA, AMD, TPU, and Tenstorrent GPUs—…☆2,090Updated this week
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- AdalFlow: The library to build & auto-optimize LLM applications.☆4,105Mar 15, 2026Updated last month
- A unified evaluation framework for large language models☆2,798Feb 20, 2026Updated last month
- Dynamic, resilient AI orchestration. Coordinate data, models, and compute as you build AI workflows. Flyte 2 now available locally: https…☆6,941Updated this week
- Awesome things you can do with ChatGPT + Code Interpreter combo 🔥☆1,013Dec 10, 2023Updated 2 years ago
- Superfast AI decision making and intelligent processing of multi-modal data.☆3,434Mar 12, 2026Updated last month
- 📦 Serverless and local-first Open Data Platform☆310Jan 22, 2026Updated 2 months ago
- Developer-friendly OSS embedded retrieval library for multimodal AI. Search More; Manage Less.☆9,945Updated this week