philschmid / evaluate-llmsLinks
Includes examples on how to evaluate LLMs
☆23Updated 7 months ago
Alternatives and similar repositories for evaluate-llms
Users that are interested in evaluate-llms are comparing it to the libraries listed below
Sorting:
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 8 months ago
- ☆45Updated last year
- ☆77Updated last year
- Mistral + Haystack: build RAG pipelines that rock 🤘☆105Updated last year
- Large Language Model (LLM) powered evaluator for Retrieval Augmented Generation (RAG) pipelines.☆27Updated last year
- ☆15Updated last year
- Fine-tune an LLM to perform batch inference and online serving.☆111Updated last week
- ☆89Updated last year
- An index of all of our weekly concepts + code events for aspiring AI Engineers and Business Leaders!!☆72Updated 2 weeks ago
- Simple examples using Argilla tools to build AI☆53Updated 6 months ago
- ☆72Updated last year
- A RAG that can scale 🧑🏻💻☆11Updated last year
- ☆23Updated last year
- My Gen AI research☆11Updated last year
- Sample notebooks and prompts for LLM evaluation☆131Updated this week
- A semantic research engine to get relevant papers based on a user query. Application frontend with Chainlit Copilot. Observability with L…☆82Updated last year
- ☆29Updated last year
- Low latency, High Accuracy, Custom Query routers for Humans and Agents. Built by Prithivi Da☆105Updated 2 months ago
- ☆143Updated 10 months ago
- Set of scripts to finetune LLMs☆37Updated last year
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆48Updated last year
- GenAI Experimentation☆57Updated last month
- This repo is the central repo for all the RAG Evaluation reference material and partner workshop☆64Updated last month
- Repository containing awesome resources regarding Hugging Face tooling.☆47Updated last year
- ☆19Updated 7 months ago
- RAG example using DSPy, Gradio, FastAPI☆80Updated last year
- A Hands-on Practical Guide to LlamaIndex☆33Updated 7 months ago
- Automatic Prompt Optimization☆36Updated last year
- Using LlamaIndex with Ray for productionizing LLM applications☆71Updated last year