tcapelle / mistral_wandbLinks
A full fledged mistral+wandb
β13Updated 11 months ago
Alternatives and similar repositories for mistral_wandb
Users that are interested in mistral_wandb are comparing it to the libraries listed below
Sorting:
- Codebase accompanying the Summary of a Haystack paper.β79Updated 10 months ago
- This is the reproduction repository for my π€ Hugging Face blog post on synthetic dataβ68Updated last year
- A comprehensive guide to LLM evaluation methods designed to assist in identifying the most suitable evaluation techniques for various useβ¦β132Updated last week
- A small library of LLM judgesβ251Updated last week
- Includes examples on how to evaluate LLMsβ23Updated 9 months ago
- β145Updated last year
- β80Updated last year
- A framework for standardizing evaluations of large foundation models, beyond single-score reporting and rankings.β166Updated last week
- Sample notebooks and prompts for LLM evaluationβ138Updated 2 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracyβ102Updated last year
- β72Updated last year
- Let Me Speak Freely? A Study on the Impact of Format Restrictions on Performance of Large Language Modelsβ22Updated 2 months ago
- The repository contains generative AI analytics platform application code.β26Updated 3 months ago
- Learning to route instances for Human vs AI Feedback (ACL 2025 Main)β23Updated 3 weeks ago
- AIDE: the Machine Learning CodeGen Agentβ24Updated 10 months ago
- Retrieval Augmented Generation Generalized Evaluation Datasetβ54Updated 3 weeks ago
- π§ Compare how Agent systems perform on several benchmarks. ππβ100Updated last week
- Lightweight demos for finetuning LLMs. Powered by π€ transformers and open-source datasets.β78Updated 9 months ago
- β23Updated 2 years ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.β48Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillationβ29Updated 6 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paperβ¦β107Updated last year
- β41Updated last year
- RAGElo is a set of tools that helps you selecting the best RAG-based LLM agents by using an Elo rankerβ114Updated last month
- This repo is the central repo for all the RAG Evaluation reference material and partner workshopβ73Updated 3 months ago
- β88Updated last year
- β48Updated last year
- Lite weight wrapper for the independent implementation of SPLADE++ models for search & retrieval pipelines. Models and Library created byβ¦β32Updated 11 months ago
- Exploring limitations of LLM-as-a-judgeβ19Updated 11 months ago
- β30Updated 2 years ago