aymeric-roucher / benchmark_agents
☆27Updated last year
Alternatives and similar repositories for benchmark_agents
Users that are interested in benchmark_agents are comparing it to the libraries listed below
Sorting:
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆103Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆76Updated 6 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆61Updated last year
- ☆16Updated last year
- ☆52Updated 3 months ago
- ☆84Updated last year
- Build Enterprise RAG (Retriver Augmented Generation) Pipelines to tackle various Generative AI use cases with LLM's by simply plugging co…☆109Updated 9 months ago
- SCREWS: A Modular Framework for Reasoning with Revisions☆27Updated last year
- Collection of resources for RL and Reasoning☆25Updated 3 months ago
- Reward Model framework for LLM RLHF☆61Updated last year
- ☆20Updated 3 years ago
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆48Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Building a chatbot powered with a RAG pipeline to read,summarize and quote the most relevant papers related to the user query.☆167Updated last year
- ☆48Updated 6 months ago
- EMNLP 2024 "Re-reading improves reasoning in large language models". Simply repeating the question to get bidirectional understanding for…☆25Updated 5 months ago
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆99Updated last year
- ☆77Updated 11 months ago
- Resources related to EACL 2023 paper "SwitchPrompt: Learning Domain-Specific Gated Soft Prompts for Classification in Low-Resource Domain…☆52Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 6 months ago
- A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback.☆23Updated last month
- ☆24Updated last year
- Codebase accompanying the Summary of a Haystack paper.☆78Updated 7 months ago
- ☆75Updated last year
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆162Updated last year
- ☆20Updated last year
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆38Updated last month
- Examples of RAG using LangChain with local LLMs - Mixtral 8x7B, Llama 2, Mistral 7B, Orca 2, Phi-2, Neural 7B☆38Updated last year
- ☆20Updated last year