lucidrains / self-rewarding-lm-pytorch
Implementation of the training framework proposed in Self-Rewarding Language Model, from MetaAI
☆1,359Updated 9 months ago
Alternatives and similar repositories for self-rewarding-lm-pytorch:
Users that are interested in self-rewarding-lm-pytorch are comparing it to the libraries listed below
- The official implementation of Self-Play Fine-Tuning (SPIN)☆1,105Updated 8 months ago
- YaRN: Efficient Context Window Extension of Large Language Models☆1,402Updated 9 months ago
- Codebase for Merging Language Models (ICML 2024)☆793Updated 8 months ago
- A family of open-sourced Mixture-of-Experts (MoE) Large Language Models☆1,427Updated 10 months ago
- Reaching LLaMA2 Performance with 0.1M Dollars☆967Updated 6 months ago
- Code for Quiet-STaR