Zyphra / zcookbookLinks

Training hybrid models for dummies.

☆27

Alternatives and similar repositories for zcookbook

Users that are interested in zcookbook are comparing it to the libraries listed below

Sorting:

s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆58Updated last week
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆29Updated this week
catid / lllm
Latent Large Language Models
☆19Updated last year
xjdr-alt / muzero_sketch
☆40Updated last year
joey00072 / Attention-as-graph
alternative way to calculating self attention
☆18Updated last year
brendanhogan / completion_tree_view
☆14Updated 5 months ago
zaydzuhri / flame
Fork of Flame repo for training of some new stuff in development
☆18Updated 2 weeks ago
Birch-san / booru-embed
[WIP] Transformer to embed Danbooru labelsets
☆13Updated last year
arcee-ai / DAM
☆55Updated 11 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 6 months ago
Zyphra / Zyda_processing
☆39Updated last year
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Updated 2 years ago
okarthikb / state-space-models
☆28Updated last year
argilla-io / distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Updated last year
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆45Updated last year
kyutai-labs / dactory
☆43Updated last week
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated 2 years ago
fsndzomga / open_source_lrm
☆10Updated last year
RiddleHe / llm-interp
A collection of lightweight interpretability scripts to understand how LLMs think
☆59Updated last week
xjdr-alt / llmri
look how they massacred my boy
☆63Updated last year
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
Algomancer / The-Daily-Train
Training Models Daily
☆16Updated last year
kubernetes-bad / reward-composer
Lego for GRPO
☆30Updated 4 months ago
RWKV / ZeroCoT
https://x.com/BlinkDL_AI/status/1884768989743882276
☆28Updated 5 months ago
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 8 months ago
foundation-model-stack / bamba
Train, tune, and infer Bamba model
☆134Updated 4 months ago
SonicCodes / subcloning
implementation of https://arxiv.org/pdf/2312.09299
☆21Updated last year
facebookresearch / matrix
Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…
☆97Updated this week
CERC-AAI / Robin
☆63Updated last year
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆72Updated 6 months ago