fsndzomga / open_source_lrmLinks
☆10Updated last year
Alternatives and similar repositories for open_source_lrm
Users that are interested in open_source_lrm are comparing it to the libraries listed below
Sorting:
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna☆59Updated 3 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆24Updated last year
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆34Updated last year
- Latent Large Language Models☆19Updated last year
- ☆56Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆61Updated last year
- Project code for training LLMs to write better unit tests + code☆21Updated 8 months ago
- Simple GRPO scripts and configurations.☆59Updated last year
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆70Updated 2 years ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆35Updated 9 months ago
- ☆53Updated last year
- ☆40Updated last year
- ☆105Updated last year
- ☆27Updated last year
- alternative way to calculating self attention☆18Updated last year
- Comparing retrieval abilities from GPT4-Turbo and a RAG system on a toy example for various context lengths☆35Updated 2 years ago
- Example code using the DSPy framework.☆20Updated last year
- ☆45Updated 2 years ago
- Small, simple agent task environments for training and evaluation☆19Updated last year
- Python library to use Pleias-RAG models☆68Updated 9 months ago
- GoldFinch and other hybrid transformer components☆45Updated last year
- ☆31Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆38Updated 8 months ago
- ☆32Updated last year
- ☆41Updated last year
- Official repo for Learning to Reason for Long-Form Story Generation☆74Updated 9 months ago
- Luth is a state-of-the-art series of fine-tuned LLMs for French☆41Updated 3 months ago
- ☆92Updated 2 weeks ago
- ☆25Updated 9 months ago
- Writing Blog Posts with Generative Feedback Loops!☆50Updated last year