dmahan93 / lm-evaluation-harnessLinks
A framework for few-shot evaluation of autoregressive language models.
☆16Updated last year
Alternatives and similar repositories for lm-evaluation-harness
Users that are interested in lm-evaluation-harness are comparing it to the libraries listed below
Sorting:
- ☆23Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 10 months ago
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- ☆37Updated 2 years ago
- Writing Blog Posts with Generative Feedback Loops!☆48Updated last year
- ☆20Updated last year
- ☆43Updated 3 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆63Updated last year
- ☆49Updated 6 months ago
- ☆11Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- ☆17Updated last week
- Explore the use of DSPy for extracting features from PDFs 🔎☆40Updated last year
- ☆31Updated last year
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆22Updated 6 months ago
- Supervised instruction finetuning for LLM with HF trainer and Deepspeed☆35Updated last year
- Evaluating LLMs with CommonGen-Lite☆90Updated last year
- Example for Logging LLM Evaluator Prompt Responses☆15Updated last year
- LLM-Training-API: Including Embeddings & ReRankers, mergekit, LaserRMT☆27Updated last year
- ☆48Updated last year
- ☆30Updated 10 months ago
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆35Updated last year
- ☆19Updated 7 months ago
- Using modal.com to process FineWeb-edu data☆20Updated last month
- Set of scripts to finetune LLMs☆37Updated last year
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆33Updated 3 weeks ago
- ☆57Updated 8 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆55Updated 2 weeks ago
- Tools for merging pretrained large language models.☆19Updated 11 months ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆72Updated last week