CarperAI / OpenELM
Evolution Through Large Models
☆696Updated last year
Related projects ⓘ
Alternatives and complementary repositories for OpenELM
- Human preference data for "Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback"☆1,625Updated last year
- A simulation framework for RLHF and alternatives. Develop your RLHF method without collecting human data.☆783Updated 4 months ago
- A repository for research on medium sized language models.☆480Updated this week
- Dromedary: towards helpful, ethical and reliable LLMs.☆1,128Updated last year
- [ICML 2024] Official repository for "Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models"☆685Updated 3 months ago
- 800,000 step-level correctness labels on LLM solutions to MATH problems☆1,669Updated last year
- Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI☆1,338Updated 7 months ago
- A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).☆745Updated this week
- [ACL2023] We introduce LLM-Blender, an innovative ensembling framework to attain consistently superior performance by leveraging the dive…☆886Updated last month
- Used for adaptive human in the loop evaluation of language and embedding models.☆304Updated last year
- ☆860Updated 11 months ago
- ☆508Updated 9 months ago
- Code for fine-tuning Platypus fam LLMs using LoRA☆623Updated 9 months ago
- Code for Quiet-STaR☆654Updated 3 months ago
- ICML 2024: Improving Factuality and Reasoning in Language Models through Multiagent Debate☆355Updated last year
- ☆1,080Updated 10 months ago
- DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models☆840Updated 7 months ago
- The hub for EleutherAI's work on interpretability and learning dynamics☆2,285Updated 3 weeks ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways☆821Updated 2 years ago
- Extend existing LLMs way beyond the original training length with constant memory usage, without retraining☆677Updated 7 months ago
- Official repository of Evolutionary Optimization of Model Merging Recipes☆1,231Updated 7 months ago
- [NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking☆252Updated 4 months ago
- Convolutions for Sequence Modeling☆869Updated 5 months ago
- Ask Me Anything language model prompting☆539Updated last year
- Inference code for Persimmon-8B☆416Updated last year
- ☆453Updated this week
- ☆344Updated last year
- [NeurIPS 2022] 🛒WebShop: Towards Scalable Real-World Web Interaction with Grounded Language Agents☆277Updated 2 months ago