attentionmech / trunkLinks

LLM trunk in 2d

☆10

Alternatives and similar repositories for trunk

Users that are interested in trunk are comparing it to the libraries listed below

Sorting:

attentionmech / dex
Pokedex for LLMs
☆13Updated 2 months ago
Zyphra / transformers_zamba2
☆48Updated 4 months ago
kubernetes-bad / reward-composer
Lego for GRPO
☆28Updated last month
ZihanWang314 / coeCheck
☆18Updated 3 months ago
AtakanTekparmak / agento
Very minimal (and stateless) agent framework
☆44Updated 5 months ago
attentionmech / tensorlens
aesthetic tensor visualiser
☆24Updated 2 months ago
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆29Updated this week
Think-a-Tron / evolve
open source alpha evolve
☆64Updated last month
menloresearch / deep-research
☆43Updated this week
attentionmech / smolbox
smolbox of recipies
☆28Updated 2 months ago
OpenPipe / rl-experiments
OpenPipe Reinforcement Learning Experiments
☆25Updated 3 months ago
devvrit / matformer
MatFormer repo
☆31Updated 6 months ago
facebookresearch / ZeroSumEval
A framework for pitting LLMs against each other in an evolving library of games ⚔
☆31Updated 2 months ago
tokenbender / avataRL
rl from zero pretrain, can it be done? we'll see.
☆56Updated this week
ritabratamaiti / AnyModal
AnyModal is a Flexible Multimodal Language Model Framework for PyTorch
☆96Updated 6 months ago
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆66Updated 2 months ago
brendanhogan / picoDeepResearch
☆63Updated last month
joey00072 / Attention-as-graph
alternative way to calculating self attention
☆18Updated last year
KindXiaoming / physics_of_skill_learning
We study toy models of skill learning.
☆29Updated 5 months ago
arcee-ai / DAM
☆51Updated 7 months ago
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
SebastianBodza / EnsembleForecasting
Using multiple LLMs for ensemble Forecasting
☆16Updated last year
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆26Updated 7 months ago
fal-ai-community / llmdifftracker
Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)
☆34Updated 4 months ago
axolotl-ai-cloud / axolotl-cookbook
☆34Updated 3 months ago
UpstageAI / evalverse-IFEval
Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…
☆14Updated last year
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆78Updated 6 months ago
mkurman / grpo-llm-evaluator
Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…
☆45Updated last month
ibm-granite / granite-embedding-models
☆29Updated this week
facebookresearch / matrix
Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…
☆71Updated this week