aws-samples / evaluating-large-language-models-using-llm-as-a-judgeLinks
β22Updated last year
Alternatives and similar repositories for evaluating-large-language-models-using-llm-as-a-judge
Users that are interested in evaluating-large-language-models-using-llm-as-a-judge are comparing it to the libraries listed below
Sorting:
- A simple Streamlit application to visualize document chunks and queries in embedding space πΊοΈπβ13Updated 9 months ago
- β43Updated last year
- A method for steering llms to better follow instructionsβ76Updated 5 months ago
- β56Updated 7 months ago
- β21Updated last year
- β52Updated 8 months ago
- β24Updated last year
- β82Updated 2 months ago
- β80Updated last year
- Writing Blog Posts with Generative Feedback Loops!β50Updated last year
- β55Updated last year
- β47Updated last year
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β34Updated last year
- β147Updated last year
- Codebase accompanying the Summary of a Haystack paper.β80Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absoluteβ¦β51Updated last year
- β23Updated 2 months ago
- Question Answering Generative AI application with Large Language Models (LLMs) and Amazon OpenSearch Serviceβ28Updated last year
- Streamlit app for recommending eval functions using prompt diffsβ30Updated 2 years ago
- Encountering 14 different Naive RAG fails and using KG to solve itβ20Updated last month
- Generate Tools and Toolkits from any Python SDK -- no extra code requiredβ54Updated last year
- Creating Generative AI Apps which workβ17Updated 9 months ago
- Dynamic Metadata based RAG Frameworkβ78Updated last month
- This repo is the central repo for all the RAG Evaluation reference material and partner workshopβ79Updated 9 months ago
- β54Updated 2 weeks ago
- Generative AI with Amazon Bedrock, published by Packtβ27Updated last year
- Adding NeMo Guardrails to a LlamaIndex RAG pipelineβ41Updated last year
- This repository demonstrates the construction of a state-of-the-art multimodal search engine, leveraging Amazon Titan Embeddings, Amazon β¦β54Updated 4 months ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)β75Updated last year
- β39Updated last year