catid / lllmLinks

Latent Large Language Models

☆19

Alternatives and similar repositories for lllm

Users that are interested in lllm are comparing it to the libraries listed below

Sorting:

s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆58Updated last week
xjdr-alt / muzero_sketch
☆40Updated last year
arcee-ai / DAM
☆55Updated 11 months ago
euclaise / supertrainer2000
☆50Updated last year
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 8 months ago
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated 2 years ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated last year
brendanhogan / completion_tree_view
☆14Updated 6 months ago
Birch-san / booru-embed
[WIP] Transformer to embed Danbooru labelsets
☆13Updated last year
EleutherAI / improved-t5
Experiments for efforts to train a new and improved t5
☆75Updated last year
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆45Updated last year
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆35Updated last year
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 11 months ago
CERC-AAI / Robin
☆63Updated last year
evanatyourservice / llm-jax
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆18Updated 3 months ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆72Updated 6 months ago
kubernetes-bad / reward-composer
Lego for GRPO
☆30Updated 4 months ago
Zyphra / zcookbook
Training hybrid models for dummies.
☆27Updated last week
rosmineb / unit_test_rl
Project code for training LLMs to write better unit tests + code
☆21Updated 5 months ago
argilla-io / distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Updated last year
joey00072 / Attention-as-graph
alternative way to calculating self attention
☆18Updated last year
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆62Updated 2 years ago
enjalot / latent-data-modal
Using modal.com to process FineWeb-edu data
☆20Updated 6 months ago
RobertCsordas / moe
Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"
☆38Updated 4 months ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆49Updated 8 months ago
Zyphra / Zyda_processing
☆39Updated last year
nisten / grokadamw
new optimizer
☆20Updated last year
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 6 months ago
NolanoOrg / SpectraSuite
☆51Updated last year
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated last year