ai-evals-course / judgyLinks
Python package for estimating a CIs for metrics evaluated by LLM-as-Judges.
☆75Updated 7 months ago
Alternatives and similar repositories for judgy
Users that are interested in judgy are comparing it to the libraries listed below
Sorting:
- Easiest way to give context to LLMs; Attachments has the ambition to be the general funnel for any files to be transformed into images+te…☆327Updated 3 months ago
- ☆84Updated last year
- ☆85Updated 3 months ago
- ☆87Updated 2 months ago
- ☆67Updated 4 months ago
- A comprehensive 0-to-1 guide for building self-improving LLM applications with DSPy framework☆197Updated 2 months ago
- Get a markdown version of any webpage with a keyboard shortcut.☆67Updated 10 months ago
- How to build the best search, one step at a time!☆229Updated last month
- Simple UI for debugging correlations of text embeddings☆306Updated 7 months ago
- ☆36Updated 7 months ago
- DSPydantic: Auto-Optimize Your Pydantic Models with DSPy☆225Updated 2 weeks ago
- ☆79Updated last year
- dspy-cli is a tool for creating, developing, testing, and deploying DSPy programs as HTTP APIs.☆63Updated this week
- A Lightweight Library for AI Observability☆253Updated 10 months ago
- ☆55Updated 8 months ago
- Plug-and-play document AI with zero-shot models.☆120Updated this week
- Context Engineering Course with DSPy☆207Updated 5 months ago
- A small library of LLM judges☆311Updated 5 months ago
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆52Updated last year
- Minimal agent runtime built with DSPy modules and a thin Python loop. Includes CLI, FastAPI server, and eval harness with OpenAI/Ollama s…☆65Updated last week
- Dynamic Metadata based RAG Framework☆78Updated 3 weeks ago
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆119Updated 9 months ago
- ☆36Updated 10 months ago
- A Chrome extension that saves conversations with Claude to GitHubGists or your clipboard.☆89Updated last year
- A collection of Compound Retrieval Systems implemented with DSPy and Weaviate.☆92Updated 2 months ago
- ☆162Updated 2 weeks ago
- ☆262Updated last month
- SCIPE is a powerful tool for evaluating and diagnosing LLM (Large Language Model) graphs or chains.☆25Updated last year
- ☆80Updated last year
- Deep Research for your internal data☆351Updated 6 months ago