bjoernpl / lm-evaluation-harness-deLinks
A framework for few-shot evaluation of autoregressive language models.
☆13Updated last year
Alternatives and similar repositories for lm-evaluation-harness-de
Users that are interested in lm-evaluation-harness-de are comparing it to the libraries listed below
Sorting:
- A repository containing the code for translating popular LLM benchmarks to German.☆25Updated last year
- ☆121Updated 2 months ago
- Manage scalable open LLM inference endpoints in Slurm clusters☆258Updated 10 months ago
- A collection of datasets for language model pretraining including scripts for downloading, preprocesssing, and sampling.☆59Updated 10 months ago
- Let's build better datasets, together!☆259Updated 5 months ago
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆136Updated 2 weeks ago
- experiments with inference on llama☆104Updated last year
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆124Updated last year
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆82Updated last year
- Code for the MTEB Arena☆19Updated 8 months ago
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆221Updated 7 months ago
- This is our own implementation of 'Layer Selective Rank Reduction'☆239Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆63Updated last year
- German Alpaca Dataset (Cleaned + Translated)☆24Updated 2 years ago
- A set of scripts and notebooks on LLM finetunning and dataset creation☆111Updated 8 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 4 months ago
- ☆38Updated 10 months ago
- ☆95Updated 5 months ago
- ☆76Updated last year
- ☆118Updated 9 months ago
- This repository contains the code for dataset curation and finetuning of instruct variant of the Bilingual OpenHathi model. The resultin…☆23Updated last year
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆77Updated 7 months ago
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆128Updated last year
- Generalist and Lightweight Model for Text Classification☆128Updated 2 weeks ago
- BABILong is a benchmark for LLM evaluation using the needle-in-a-haystack approach.☆202Updated last month
- Fully fine-tune large models like Mistral, Llama-2-13B, or Qwen-14B completely for free☆231Updated 7 months ago
- Toolkit for attaching, training, saving and loading of new heads for transformer models☆279Updated 3 months ago
- ☆66Updated last year
- Simple UI for debugging correlations of text embeddings☆256Updated last week
- A stable, fast and easy-to-use inference library with a focus on a sync-to-async API☆45Updated 8 months ago