aws-samples / evaluating-large-language-models-using-llm-as-a-judge
☆13Updated 2 months ago
Alternatives and similar repositories for evaluating-large-language-models-using-llm-as-a-judge:
Users that are interested in evaluating-large-language-models-using-llm-as-a-judge are comparing it to the libraries listed below
- ☆18Updated 5 months ago
- Writing Blog Posts with Generative Feedback Loops!☆47Updated last year
- ☆1Updated 8 months ago
- Streamlit app for recommending eval functions using prompt diffs☆27Updated last year
- A framework for simulating e-commerce data and interactions that can be used to build recommendation systems☆10Updated last year
- 💙 Unstructured Data Connectors for Haystack 2.0☆16Updated last year
- Dynamic Metadata based RAG Framework☆72Updated 7 months ago
- ☆41Updated 3 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 8 months ago
- Check for data drift between two OpenAI multi-turn chat jsonl files.☆38Updated 11 months ago
- QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unl…☆33Updated 3 weeks ago
- Uses a Gradio interface to stream coding related responses from local and cloud based large language models. Pulls context from GitHub Re…☆20Updated last week
- The official evaluation suite and dynamic data release for MixEval.☆11Updated 5 months ago
- ☆30Updated 8 months ago
- ☆38Updated last week
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆34Updated last year
- Complete example of how to build an Agentic RAG architecture with Redis, Amazon Bedrock, and LlamaIndex.☆91Updated 3 months ago
- Experimentation on google's gemma model☆16Updated last year
- Creating Generative AI Apps which work☆16Updated 8 months ago
- ☆30Updated 2 months ago
- Unleash the full potential of exascale LLMs on consumer-class GPUs, proven by extensive benchmarks, with no long-term adjustments and min…☆25Updated 4 months ago
- Build Agentic workflows with function calling using open LLMs☆26Updated 2 weeks ago
- Tools for merging pretrained large language models.☆19Updated 9 months ago
- ☆15Updated 5 months ago
- AI_Powered_Dev_Search_Engine☆12Updated last year