aws-samples / evaluating-large-language-models-using-llm-as-a-judgeView external linksLinks
☆22Jan 13, 2025Updated last year
Alternatives and similar repositories for evaluating-large-language-models-using-llm-as-a-judge
Users that are interested in evaluating-large-language-models-using-llm-as-a-judge are comparing it to the libraries listed below
Sorting:
- ☆11Aug 21, 2023Updated 2 years ago
- Usage examples for byte-genie API☆12Apr 27, 2024Updated last year
- Exploring limitations of LLM-as-a-judge☆20Aug 17, 2024Updated last year
- ☆20Mar 22, 2024Updated last year
- ☆24Dec 12, 2024Updated last year
- ☆14Jun 16, 2020Updated 5 years ago
- Question Answering Generative AI application with Large Language Models (LLMs) and Amazon OpenSearch Service☆28Dec 3, 2024Updated last year
- This repository contains several example sub-projects related to data modeling using Redis with Redis OM for Python☆14Mar 2, 2022Updated 3 years ago
- ☆26May 13, 2025Updated 9 months ago
- Source code for GreaTer ICLR 2025 - Gradient Over Reasoning makes Smaller Language Models Strong Prompt Optimizers☆34Apr 18, 2025Updated 9 months ago
- Oak National Academy's AI Auto Eval tools provide LLM as a judge evaluation on lesson plans and resources☆17Nov 4, 2025Updated 3 months ago
- ☆10Nov 12, 2024Updated last year
- About NOSH ChartingSystem is an electronic health record system designed exclusively for doctors and patients.☆10Mar 30, 2025Updated 10 months ago
- Personal Identifiable Information (PII) entity detection and performance enhancement with synthetic data generation☆36Sep 4, 2024Updated last year
- This repository contains a pipeline for fine-tuning Large Language Models (LLMs) for Text-to-SQL conversion using General Reward Proximal…☆44Apr 16, 2025Updated 9 months ago
- This is the base code for the Manufacturing Vision for amd64 architecture.☆13Nov 11, 2022Updated 3 years ago
- This project showcases engaging interactions between two AI chatbots.☆10Jan 10, 2024Updated 2 years ago
- 🥪💾 A sample of data from the `jaffle-shop-generator` that powers the Jaffle Shop spanning one year.☆14Jan 23, 2025Updated last year
- ☆10Mar 31, 2025Updated 10 months ago
- This repository will contain the presentation and python jupyter notebooks for my DataHack Summit 2025 conference talk, Building Effectiv…☆74Aug 25, 2025Updated 5 months ago
- Deploying a custom pytorch model to AWS Sagemaker using terraform and FastAPI☆10Nov 10, 2023Updated 2 years ago
- Indexing framework designed for the automated creation of structured knowledge bases in Azure AI Search☆14Jun 18, 2025Updated 7 months ago
- AWS Quick Start for VMware Tanzu Application Platform☆12Nov 1, 2023Updated 2 years ago
- This is a repo for the LinkedIn Learning course AI Pair Programming with GitHub Copilot☆12Sep 6, 2022Updated 3 years ago
- An IOT based mobile application to monitor the vitals such as ECG, Body Temperature, Blood Pressure using an ESP32 DevKit and React Nativ…☆11Nov 14, 2024Updated last year
- ☆12Jun 17, 2023Updated 2 years ago
- ☆47Oct 29, 2024Updated last year
- GitHub Copilot Adoption Plan - Workshops - Labs☆18Sep 18, 2025Updated 4 months ago
- Official repository for "DYPLOC: Dynamic Planning of Content Using Mixed Language Models for Opinion Text Generation"☆10May 20, 2022Updated 3 years ago
- ☆12Aug 15, 2023Updated 2 years ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆13Jan 1, 2025Updated last year
- Simple web applications for the AGL platform.☆12Jun 17, 2022Updated 3 years ago
- My small contribution to this new agentic world☆14Aug 22, 2025Updated 5 months ago
- Lelylan API proxy☆13Dec 12, 2016Updated 9 years ago
- Data generator for Amazon MSK☆18May 7, 2024Updated last year
- Project examples☆12Feb 15, 2016Updated 10 years ago
- This repository contains code example in how to write search queries with OpenSearch Python client☆10Sep 20, 2023Updated 2 years ago
- An implementation of Compositional Attention: Disentangling Search and Retrieval by MILA☆14Jun 1, 2022Updated 3 years ago
- Official code for the paper "Does CLIP's Generalization Performance Mainly Stem from High Train-Test Similarity?" (ICLR 2024)☆10Aug 26, 2024Updated last year