aws-samples / evaluating-large-language-models-using-llm-as-a-judgeLinks
☆20Updated 10 months ago
Alternatives and similar repositories for evaluating-large-language-models-using-llm-as-a-judge
Users that are interested in evaluating-large-language-models-using-llm-as-a-judge are comparing it to the libraries listed below
Sorting:
- Question Answering Generative AI application with Large Language Models (LLMs) and Amazon OpenSearch Service☆27Updated 11 months ago
- ☆45Updated last year
- ☆24Updated 11 months ago
- ☆53Updated last year
- A simple Streamlit application to visualize document chunks and queries in embedding space 🗺️🔍☆13Updated 7 months ago
- ☆49Updated 6 months ago
- ☆21Updated last year
- Generative AI with Amazon Bedrock, published by Packt☆26Updated last year
- ☆55Updated 4 months ago
- ☆80Updated last year
- Streamlit app for recommending eval functions using prompt diffs☆30Updated last year
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- ☆80Updated last week
- ☆22Updated 11 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆36Updated 2 years ago
- A method for steering llms to better follow instructions☆58Updated 3 months ago
- Large Language Model Hosting Container☆90Updated last month
- ☆43Updated last year
- ☆51Updated last year
- Code Repository for Blog - How to Productionize Large Language Models (LLMs)☆12Updated last year
- LangChain, Llama2-Chat, and zero- and few-shot prompting are used to generate synthetic datasets for IR and RAG system evaluation☆37Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆51Updated last year
- ☆26Updated last year
- ☆146Updated last year
- Experimental Code for StructuredRAG: JSON Response Formatting with Large Language Models☆113Updated 7 months ago
- Creating Generative AI Apps which work☆17Updated 7 months ago
- Context is Key: Combining Embedding-based Retrieval with LLMs for Comprehensive Knowledge Enrichment☆31Updated 2 years ago
- ☆12Updated 2 years ago
- The Journey of RAG: From Notebook to Microservices☆25Updated last year
- Adding NeMo Guardrails to a LlamaIndex RAG pipeline☆41Updated last year