aws-samples / evaluating-large-language-models-using-llm-as-a-judgeLinks
☆19Updated 6 months ago
Alternatives and similar repositories for evaluating-large-language-models-using-llm-as-a-judge
Users that are interested in evaluating-large-language-models-using-llm-as-a-judge are comparing it to the libraries listed below
Sorting:
- ☆20Updated 9 months ago
- ☆40Updated 7 months ago
- ☆77Updated last year
- ☆47Updated 9 months ago
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆49Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- ☆76Updated 6 months ago
- ☆48Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆79Updated 9 months ago
- Fullstack chatbot application☆11Updated last week
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆108Updated 3 months ago
- ☆40Updated last year
- ☆94Updated 3 months ago
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created by…☆31Updated 10 months ago
- A method for steering llms to better follow instructions☆47Updated last week
- Official Repo for CRMArena and CRMArena-Pro☆101Updated 3 weeks ago
- ☆56Updated 3 weeks ago
- Dynamic Metadata based RAG Framework☆75Updated 11 months ago
- ☆48Updated 5 months ago
- Analysis code for paper "SciArena: An Open Evaluation Platform for Foundation Models in Scientific Literature Tasks"☆42Updated 2 weeks ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆35Updated 2 years ago
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆75Updated 8 months ago
- A curated list of materials on AI guardails☆39Updated last month
- ☆23Updated 5 months ago
- ☆20Updated 3 months ago
- "Syntriever: How to Train Your Retriever with Synthetic Data from LLMs" the Nations of the Americas Chapter of the Association for Comput…☆26Updated 4 months ago
- Multimodal AI workloads: batch inference, model training and online serving.☆22Updated 3 weeks ago
- Complete example of how to build an Agentic RAG architecture with Redis, Amazon Bedrock, and LlamaIndex.☆95Updated 7 months ago
- A simple Streamlit application to visualize document chunks and queries in embedding space 🗺️🔍☆13Updated 3 months ago