attentionmech / trunkLinks
LLM trunk in 2d
☆10Updated 2 months ago
Alternatives and similar repositories for trunk
Users that are interested in trunk are comparing it to the libraries listed below
Sorting:
- Pokedex for LLMs☆13Updated 2 months ago
- ☆48Updated 4 months ago
- Lego for GRPO☆28Updated last month
- ☆18Updated 3 months ago
- Very minimal (and stateless) agent framework☆44Updated 5 months ago
- aesthetic tensor visualiser☆24Updated 2 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆29Updated this week
- open source alpha evolve☆64Updated last month
- ☆43Updated this week
- smolbox of recipies☆28Updated 2 months ago
- OpenPipe Reinforcement Learning Experiments☆25Updated 3 months ago
- MatFormer repo☆31Updated 6 months ago
- A framework for pitting LLMs against each other in an evolving library of games ⚔☆31Updated 2 months ago
- rl from zero pretrain, can it be done? we'll see.☆56Updated this week
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆96Updated 6 months ago
- NanoGPT-speedrunning for the poor T4 enjoyers☆66Updated 2 months ago
- ☆63Updated last month
- alternative way to calculating self attention☆18Updated last year
- We study toy models of skill learning.☆29Updated 5 months ago
- ☆51Updated 7 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- Using multiple LLMs for ensemble Forecasting☆16Updated last year
- entropix style sampling + GUI☆26Updated 7 months ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Updated 4 months ago
- ☆34Updated 3 months ago
- Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…☆14Updated last year
- An introduction to LLM Sampling☆78Updated 6 months ago
- Fine-tunes a student LLM using teacher feedback for improved reasoning and answer quality. Implements GRPO with teacher-provided evaluati…☆45Updated last month
- ☆29Updated this week
- Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆71Updated this week