aiverify-foundation / LLM-Evals-CatalogueLinks
This repository stems from our paper, “Cataloguing LLM Evaluations”, and serves as a living, collaborative catalogue of LLM evaluation frameworks, benchmarks and papers.
☆17Updated last year
Alternatives and similar repositories for LLM-Evals-Catalogue
Users that are interested in LLM-Evals-Catalogue are comparing it to the libraries listed below
Sorting:
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various use…☆123Updated last week
- Sample notebooks and prompts for LLM evaluation☆135Updated last month
- Initiative to evaluate and rank the most popular LLMs across common task types based on their propensity to hallucinate.☆111Updated 10 months ago
- ☆20Updated last year
- ☆71Updated 8 months ago
- ☆145Updated 11 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆107Updated last year
- What, Why and How of LLMs.☆75Updated last year
- ☆37Updated last year
- FrugalGPT: better quality and lower cost for LLM applications☆223Updated 5 months ago
- Notebooks and articles related to LLMs☆26Updated last year
- LangFair is a Python library for conducting use-case level LLM bias and fairness assessments☆219Updated this week
- A library for evaluating Retrieval-Augmented Generation (RAG) systems (The traditional ways).☆37Updated 11 months ago
- RAGArch is a Streamlit-based application that empowers users to experiment with various components and parameters of Retrieval-Augmented …☆85Updated last year
- Fiddler Auditor is a tool to evaluate language models.☆183Updated last year
- Repository to demonstrate Chain of Table reasoning with multiple tables powered by LangGraph☆144Updated last year
- A semantic research engine to get relevant papers based on a user query. Application frontend with Chainlit Copilot. Observability with L…☆83Updated last year
- DSPY on action with OpenSource LLMs.☆72Updated last year
- An index of all of our weekly concepts + code events for aspiring AI Engineers and Business Leaders!!☆75Updated last week
- Automated knowledge graph creation SDK☆122Updated 7 months ago
- 🦜💯 Flex those feathers!☆252Updated 8 months ago
- Mistral + Haystack: build RAG pipelines that rock 🤘☆105Updated last year
- Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on task…☆173Updated 9 months ago
- This repository implements the chain of verification paper by Meta AI☆171Updated last year
- Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"☆135Updated last year
- This software contains an agent based on LangGraph & LangChain for solving general requests in the Whatsapp channel of this medical clini…☆203Updated 9 months ago
- Learn to build and customize multi-agent systems using the AutoGen. The course teaches you to implement complex AI applications through a…☆93Updated last year
- ☆87Updated 2 months ago
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆112Updated 11 months ago
- Lean implementation of various multi-agent LLM methods, including Iteration of Thought (IoT)☆115Updated 5 months ago