CarperAI / OpenELM
Evolution Through Large Models
☆711Updated last year
Alternatives and similar repositories for OpenELM:
Users that are interested in OpenELM are comparing it to the libraries listed below
- A repository for research on medium sized language models.☆491Updated last month
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆804Updated last week
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆913Updated 4 months ago
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆794Updated 7 months ago
- [NeurIPS 2023] MeZO: Fine-Tuning Language Models with Just Forward Passes. https://arxiv.org/abs/2305.17333☆1,083Updated last year
- Used for adaptive human in the loop evaluation of language and embedding models.☆306Updated last year
- The hub for EleutherAI's work on interpretability and learning dynamics☆2,379Updated 2 months ago
- Inference code for Persimmon-8B☆416Updated last year
- Data and tools for generating and inspecting OLMo pre-training data.☆1,086Updated this week
- LaMini-LM: A Diverse Herd of Distilled Models from Large-Scale Instructions☆819Updated last year
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,361Updated 10 months ago
- Tools for understanding how transformer predictions are built layer-by-layer☆474Updated 8 months ago
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,682Updated last year
- ☆412Updated last year
- 800,000 step-level correctness labels on LLM solutions to MATH problems☆1,907Updated last year
- Fine-tune mistral-7B on 3090s, a100s, h100s☆705Updated last year
- Simple next-token-prediction for RLHF☆222Updated last year
- This repository contains code and tooling for the Abacus.AI LLM Context Expansion project. Also included are evaluation scripts and bench…☆584Updated last year
- Ungreedy subword tokenizer and vocabulary trainer for Python, Go & Javascript☆566Updated 7 months ago
- Code for Quiet-STaR☆713Updated 6 months ago
- ☆864Updated last year
- PaL: Program-Aided Language Models (ICML 2023)☆481Updated last year
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆195Updated last week
- Inference code for Mistral and Mixtral hacked up into original Llama implementation☆371Updated last year
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆2,456Updated 6 months ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆687Updated 10 months ago
- An open collection of methodologies to help with successful training of large language models.☆472Updated last year
- Code for the paper "Rethinking Benchmark and Contamination for Language Models with Rephrased Samples"☆296Updated last year
- ☆229Updated 2 years ago
- Minimalistic large language model 3D-parallelism training☆1,483Updated this week