aws-samples / evaluating-large-language-models-using-llm-as-a-judgeLinks
β19Updated 8 months ago
Alternatives and similar repositories for evaluating-large-language-models-using-llm-as-a-judge
Users that are interested in evaluating-large-language-models-using-llm-as-a-judge are comparing it to the libraries listed below
Sorting:
- A method for steering llms to better follow instructionsβ50Updated last month
- A simple Streamlit application to visualize document chunks and queries in embedding space πΊοΈπβ13Updated 5 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β49Updated last year
- Codebase accompanying the Summary of a Haystack paper.β79Updated 11 months ago
- β50Updated 11 months ago
- β76Updated 8 months ago
- β43Updated last year
- β20Updated 11 months ago
- β80Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Modelsβ111Updated 5 months ago
- Writing Blog Posts with Generative Feedback Loops!β50Updated last year
- A curated list of materials on AI guardailsβ40Updated 3 months ago
- β40Updated 9 months ago
- Streamlit app for recommending eval functions using prompt diffsβ29Updated last year
- Official Repo for CRMArena and CRMArena-Proβ114Updated 2 months ago
- β56Updated 2 months ago
- Dynamic Metadata based RAG Frameworkβ75Updated last year
- Creating Generative AI Apps which workβ17Updated 5 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β32Updated last year
- π A deep-dive into HyDE for Advanced LLM RAG + π‘ Introducing AutoHyDE, a semi-supervised framework to improve the effectiveness, coveraβ¦β32Updated last year
- Verifiers for LLM Reinforcement Learningβ72Updated 5 months ago
- β50Updated 4 months ago
- β145Updated last year
- β24Updated 9 months ago
- β48Updated last year
- β37Updated 7 months ago
- β22Updated 9 months ago
- Explore the use of DSPy for extracting features from PDFs πβ45Updated last year
- Question Answering Generative AI application with Large Language Models (LLMs) and Amazon OpenSearch Serviceβ26Updated 9 months ago
- π Unstructured Data Connectors for Haystack 2.0β17Updated last year