ai-evals-course / judgyLinks
Python package for estimating a CIs for metrics evaluated by LLM-as-Judges.
☆75Updated 6 months ago
Alternatives and similar repositories for judgy
Users that are interested in judgy are comparing it to the libraries listed below
Sorting:
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆323Updated 3 months ago
- DSPydantic: Auto-Optimize Your Pydantic Models with DSPy☆67Updated this week
- Get a markdown version of any webpage with a keyboard shortcut.☆67Updated 9 months ago
- ☆84Updated last year
- Simple UI for debugging correlations of text embeddings☆302Updated 6 months ago
- Extract structured data from any content using LLMs.☆58Updated last week
- ☆87Updated last month
- ☆83Updated 3 months ago
- Plug-and-play, zero-shot document AI.☆119Updated this week
- How to build the best search, one step at a time!☆225Updated 2 weeks ago
- A Lightweight Library for AI Observability☆252Updated 9 months ago
- Dynamic Metadata based RAG Framework☆78Updated this week
- dspy-cli is a tool for creating, developing, testing, and deploying DSPy programs as HTTP APIs.☆55Updated last week
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆119Updated 4 months ago
- ☆36Updated 7 months ago
- A comprehensive 0-to-1 guide for building self-improving LLM applications with DSPy framework☆195Updated 2 months ago
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆372Updated 3 months ago
- ☆124Updated 3 months ago
- Train embedding and reranker models for retrieval tasks on Apple Silicon with MLX☆168Updated 2 months ago
- Context Engineering Course with DSPy☆204Updated 4 months ago
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆77Updated 7 months ago
- ☆80Updated last year
- ☆53Updated 7 months ago
- Minimal example of MCP for parsing llms.txt☆40Updated 8 months ago
- Code for evaluating with Flow-Judge-v0.1 - an open-source, lightweight (3.8B) language model optimized for LLM system evaluations. Crafte…☆78Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆118Updated 8 months ago
- A small library of LLM judges☆306Updated 4 months ago
- Deep Research for your internal data☆349Updated 6 months ago
- Named Entity Recognition using Claude Citations☆79Updated 6 months ago
- ☆65Updated 4 months ago