OpenGPTX / lm-evaluation-harnessLinks
A framework for few-shot evaluation of autoregressive language models.
☆12Updated 5 months ago
Alternatives and similar repositories for lm-evaluation-harness
Users that are interested in lm-evaluation-harness are comparing it to the libraries listed below
Sorting:
- ☆20Updated 3 weeks ago
- GoldFinch and other hybrid transformer components☆45Updated last year
- Triton Implementation of HyperAttention Algorithm☆48Updated 2 years ago
- Entailment self-training☆25Updated 2 years ago
- ☆26Updated 2 years ago
- ☆55Updated last year
- A repository for research on medium sized language models.☆77Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆62Updated last year
- Advanced Reasoning Benchmark Dataset for LLMs☆47Updated 2 years ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated last month
- MatFormer repo☆67Updated last year
- Implementation of the LDP module block in PyTorch and Zeta from the paper: "MobileVLM: A Fast, Strong and Open Vision Language Assistant …☆15Updated last year
- Source code for Activated LoRA☆23Updated last month
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆46Updated 2 months ago
- ☆29Updated last week
- Tasks and tutorials using Graphore's IPU with Hugging Face. Originally at https://github.com/gradient-ai/Graphcore-HuggingFace☆16Updated last year
- Minimum Description Length probing for neural network representations☆20Updated 11 months ago
- UQ: Assessing Language Models on Unsolved Questions☆29Updated 4 months ago
- Fork of Flame repo for training of some new stuff in development☆19Updated last week
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Updated 2 years ago
- A testbed for agents and environments that can automatically improve models through data generation.☆27Updated 10 months ago
- An EXA-Scale repository of Multi-Modality AI resources from papers and models, to foundational libraries!☆40Updated last year
- Training hybrid models for dummies.☆29Updated 2 months ago
- Aioli: A unified optimization framework for language model data mixing☆32Updated 11 months ago
- Open Implementations of LLM Analyses☆107Updated last year
- Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.☆15Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆38Updated 6 months ago
- ☆17Updated last year
- ☆71Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆41Updated last year