aws-samples / evaluating-large-language-models-using-llm-as-a-judgeLinks
β20Updated 9 months ago
Alternatives and similar repositories for evaluating-large-language-models-using-llm-as-a-judge
Users that are interested in evaluating-large-language-models-using-llm-as-a-judge are comparing it to the libraries listed below
Sorting:
- A simple Streamlit application to visualize document chunks and queries in embedding space πΊοΈπβ13Updated 6 months ago
- β55Updated 4 months ago
- Streamlit app for recommending eval functions using prompt diffsβ29Updated last year
- β43Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β50Updated last year
- Official Repo for CRMArena and CRMArena-Proβ121Updated 4 months ago
- Reference architecture for LLM-based applications on Google Cloud Platform with Redis Enterprise as a high-performance data layer.β38Updated 6 months ago
- A method for steering llms to better follow instructionsβ56Updated 2 months ago
- β49Updated 5 months ago
- β146Updated last year
- β45Updated last year
- β40Updated 10 months ago
- β24Updated 10 months ago
- β79Updated 9 months ago
- Writing Blog Posts with Generative Feedback Loops!β50Updated last year
- β20Updated last year
- β53Updated last year
- β50Updated last year
- β80Updated last year
- Codebase accompanying the Summary of a Haystack paper.β79Updated last year
- Generate Tools and Toolkits from any Python SDK -- no extra code requiredβ53Updated 11 months ago
- Question Answering Generative AI application with Large Language Models (LLMs) and Amazon OpenSearch Serviceβ27Updated 10 months ago
- Generative AI with Amazon Bedrock, published by Packtβ26Updated last year
- Supervised instruction finetuning for LLM with HF trainer and Deepspeedβ36Updated 2 years ago
- Creating Generative AI Apps which workβ17Updated 6 months ago
- Dynamic Metadata based RAG Frameworkβ76Updated last year
- This repository contains the source code for running llamaindex tutorials from https://howaibuildthis.substack.com/β41Updated last year
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging coβ¦β114Updated last year
- Complete example of how to build an Agentic RAG architecture with Redis, Amazon Bedrock, and LlamaIndex.β100Updated 10 months ago
- β37Updated last year