dair-ai / llm-evaluator
Example for Logging LLM Evaluator Prompt Responses
☆15Updated last year
Alternatives and similar repositories for llm-evaluator:
Users that are interested in llm-evaluator are comparing it to the libraries listed below
- ☆12Updated last year
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- Prompt Engineering for Large Language Models - Notebooks, Demos, Exercises, and Projects☆22Updated last year
- ChatBot App built using LangChain and Lightning AI☆18Updated last year
- ☆11Updated 9 months ago
- The Swarm Ecosystem☆19Updated 6 months ago
- Demos of some issues with LangChain.☆31Updated last year
- ☆18Updated last year
- Transform unstructured documents into actionable, structured data with enterprise-grade precision and reliability, ready for large-scale …☆17Updated this week
- ☆30Updated last year
- A swarm of LLM agents that will help you test, document, and productionize your code!☆13Updated this week
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆18Updated 5 months ago
- Radiantloom Email Assist 7B is an email-assistant large language model fine-tuned from Zephyr-7B-Beta, over a custom-curated dataset of 1…☆14Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆31Updated last year
- Unstract's interface to LLMs, Embeddings and VectorDBs.☆18Updated 6 months ago
- ☆29Updated last year
- ☆20Updated last year
- ☆1Updated 7 months ago
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆17Updated 4 months ago
- A python command-line tool to download & manage MLX AI models from Hugging Face.☆17Updated 5 months ago
- Exploration using DSPy to optimize modules to maximize performance on the OpenToM dataset☆14Updated 11 months ago
- a simple create-llama template using llama-index v0.10 and integrated with Ollama☆10Updated 9 months ago
- 🐜🔧 A minimalistic tool to fine-tune your LLMs☆17Updated last year
- ☆16Updated last week
- The original BabyAGI, updated with LiteLLM and no vector database reliance (csv instead)☆21Updated 4 months ago
- Writing Blog Posts with Generative Feedback Loops!☆47Updated 10 months ago
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.☆18Updated last year
- Dataset Viber is your chill repo for data collection, annotation and vibe checks.☆44Updated 5 months ago