dmahan93 / lm-evaluation-harnessLinks
A framework for few-shot evaluation of autoregressive language models.
☆16Updated last year
Alternatives and similar repositories for lm-evaluation-harness
Users that are interested in lm-evaluation-harness are comparing it to the libraries listed below
Sorting:
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated last year
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆168Updated last year
- ☆23Updated 2 years ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- Comparing retrieval abilities from GPT4-Turbo and a RAG system on a toy example for various context lengths☆35Updated last year
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆66Updated 9 months ago
- ☆88Updated last year
- Explore the use of DSPy for extracting features from PDFs 🔎☆45Updated last year
- ☆37Updated 2 years ago
- HuggingChat like UI in Gradio☆71Updated 2 years ago
- ☆33Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆78Updated 9 months ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆104Updated 2 months ago
- ☆49Updated 6 months ago
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 3 months ago
- 🎨 Imagine what Picasso could have done with AI. Self-host your StableDiffusion API.☆50Updated 2 years ago
- ☆47Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆65Updated last year
- ☆33Updated 2 years ago
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year
- Tools for formatting large language model prompts.☆13Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆102Updated last year
- A collection of notebooks for the Hugging Face blog series (https://huggingface.co/blog).☆45Updated last year
- ☆30Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- 💙 Unstructured Data Connectors for Haystack 2.0☆17Updated last year
- QLoRA with Enhanced Multi GPU Support☆37Updated 2 years ago
- A data-centric AI package for ML/AI. Get the best high-quality data for the best results. Discord: https://discord.gg/t6ADqBKrdZ☆63Updated last year