preset-io / promptimizeLinks

Promptimize is a prompt engineering evaluation and testing toolkit.

☆474

Alternatives and similar repositories for promptimize

Users that are interested in promptimize are comparing it to the libraries listed below

Sorting:

rogeriochaves / langstream
Build robust LLM applications with true composability 🔗
☆418Updated last year
MagnivOrg / prompt-layer-library
🍰 PromptLayer - Maintain a log of your prompts and OpenAI API requests. Track, debug, and replay old completions.
☆649Updated last week
arthur-ai / bench
A tool for evaluating LLMs
☆424Updated last year
dgarnitz / vectorflow
VectorFlow is a high volume vector embedding pipeline that ingests raw data, transforms it into vectors and writes it to a vector DB of y…
☆698Updated last year
deepset-ai / prompthub
☆173Updated last year
shroominic / funcchain
⛓️ build cognitive systems, pythonic
☆339Updated 8 months ago
TonicAI / tonic_validate
Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.
☆315Updated 3 weeks ago
langchain-ai / langchain-benchmarks
🦜💯 Flex those feathers!
☆253Updated 9 months ago
BlackHC / llm-strategy
Directly Connecting Python to LLMs via Strongly-Typed Functions, Dataclasses, Interfaces & Generic Types
☆399Updated 5 months ago
astronomer / ask-astro
An end-to-end LLM reference implementation providing a Q&A interface for Airflow and Astronomer
☆255Updated 3 weeks ago
ju-bezdek / langchain-decorators
syntactic sugar 🍭 for langchain
☆235Updated 2 months ago
approximatelabs / lambdaprompt
λprompt - A functional programming interface for building AI systems
☆380Updated last year
finic-ai / doctran
☆508Updated 11 months ago
tigerlab-ai / tiger
Open Source LLM toolkit to build trustworthy LLM applications. TigerArmor (AI safety), TigerRAG (embedding, RAG), TigerTune (fine-tuning)
☆398Updated last year
microsoft / prompt-engine-py
A utility library for creating and maintaining prompts for Large Language Models
☆234Updated 2 years ago
anthropics / anthropic-tools
☆314Updated 9 months ago
langchain-ai / auto-evaluator
☆772Updated last month
run-llama / ai-engineer-workshop
☆185Updated last year
athina-ai / athina-evals
Python SDK for running evaluations on LLM generated responses
☆291Updated 2 months ago
amoffat / HeimdaLLM
Constrain LLM output
☆112Updated last year
relari-ai / continuous-eval
Data-Driven Evaluation for LLM-Powered Applications
☆501Updated 6 months ago
BerriAI / reliableGPT
Get 100% uptime, reliability from OpenAI. Handle Rate Limit, Timeout, API, Keys Errors
☆669Updated last year
stoyan-stoyanov / llmflows
LLMFlows - Simple, Explicit and Transparent LLM Apps
☆699Updated 5 months ago
felixbrock / lemon-agent
Plan-Validate-Solve (PVS) Agent for accurate, reliable and reproducable workflow automation
☆342Updated last year
totalhack / zillion
Make sense of it all. Semantic data modeling and analytics with a sprinkle of AI. https://totalhack.github.io/zillion/
☆203Updated 2 months ago
PrefectHQ / langchain-prefect
Tools for using Langchain with Prefect
☆104Updated 2 years ago
whyhow-ai / rule-based-retrieval
The Rule-based Retrieval package is a Python package that enables you to create and manage Retrieval Augmented Generation (RAG) applicati…
☆245Updated 10 months ago
jerpint / buster
☆206Updated last year
ChrisPappalardo / eparse
Excel spreadsheet crawler and table parser for data extraction and querying
☆150Updated 5 months ago
pinecone-io / canopy
Retrieval Augmented Generation (RAG) framework and context engine powered by Pinecone
☆1,020Updated 8 months ago