fsndzomga / open_source_lrmLinks

☆10

Alternatives and similar repositories for open_source_lrm

Users that are interested in open_source_lrm are comparing it to the libraries listed below

Sorting:

catid / lllm
Latent Large Language Models
☆18Updated 11 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 6 months ago
rosmineb / unit_test_rl
Project code for training LLMs to write better unit tests + code
☆21Updated 2 months ago
arcee-ai / DAM
☆53Updated 9 months ago
brendanhogan / completion_tree_view
☆13Updated 3 months ago
ahstat / episodic-memory-benchmark
Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…
☆49Updated 3 months ago
xjdr-alt / muzero_sketch
☆38Updated last year
phunterlau / paper_without_code
LLM reads a paper and produce a working prototype
☆58Updated 3 months ago
leloykun / modded-nanogpt
NanoGPT (124M) quality in 2.67B tokens
☆28Updated last month
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 6 months ago
joshuacnf / Ctrl-G
☆88Updated 7 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 9 months ago
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
matthelmer / DSPy-examples
Example code using the DSPy framework.
☆19Updated last year
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 8 months ago
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆105Updated 7 months ago
kubernetes-bad / reward-composer
Lego for GRPO
☆28Updated 2 months ago
SebastianBodza / EnsembleForecasting
Using multiple LLMs for ensemble Forecasting
☆16Updated last year
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆49Updated 5 months ago
ZeroSumEval / ZeroSumEval
A framework for pitting LLMs against each other in an evolving library of games ⚔
☆32Updated 3 months ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆68Updated 3 months ago
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆26Updated 9 months ago
weaviate-tutorials / Hurricane
Writing Blog Posts with Generative Feedback Loops!
☆50Updated last year
joey00072 / Attention-as-graph
alternative way to calculating self attention
☆18Updated last year
CERC-AAI / Robin
☆63Updated 10 months ago
lab-v2 / langdiversity
Elevate your language models with insightful diversity metrics.
☆11Updated last year
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated 11 months ago
allenai / infinigram-api
☆73Updated 2 weeks ago
Birch-san / booru-embed
[WIP] Transformer to embed Danbooru labelsets
☆13Updated last year
kumar-shridhar / Screws
SCREWS: A Modular Framework for Reasoning with Revisions
☆27Updated last year