AlxSp / t-jepaLinks

☆11

Alternatives and similar repositories for t-jepa

Users that are interested in t-jepa are comparing it to the libraries listed below

Sorting:

kubernetes-bad / reward-composer
Lego for GRPO
☆28Updated last month
evanatyourservice / llm-jax
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆17Updated 3 months ago
RWKV / ZeroCoT
https://x.com/BlinkDL_AI/status/1884768989743882276
☆28Updated last month
okarthikb / state-space-models
☆27Updated 11 months ago
Birch-san / booru-embed
[WIP] Transformer to embed Danbooru labelsets
☆13Updated last year
nisten / grokadamw
new optimizer
☆20Updated 10 months ago
axolotl-ai-cloud / axolotl-cookbook
☆34Updated 3 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆85Updated 2 months ago
Algomancer / The-Daily-Train
Training Models Daily
☆17Updated last year
euclaise / supertrainer2000
☆49Updated last year
catid / lllm
Latent Large Language Models
☆18Updated 10 months ago
joey00072 / Attention-as-graph
alternative way to calculating self attention
☆18Updated last year
xjdr-alt / muzero_sketch
☆38Updated 11 months ago
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆66Updated 2 months ago
sfcompute / tinynarrations
A synthetic story narration dataset to study small audio LMs.
☆32Updated last year
OpenPipe / rl-experiments
OpenPipe Reinforcement Learning Experiments
☆25Updated 3 months ago
EleutherAI / training-jacobian
☆23Updated 6 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆101Updated 3 months ago
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆45Updated 11 months ago
SkunkworksAI / CodeFusion
☆15Updated last year
xjdr-alt / llmri
look how they massacred my boy
☆63Updated 8 months ago
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆29Updated this week
catid / spectral_ssm
Implementation of Spectral State Space Models
☆16Updated last year
fal-ai-community / llmdifftracker
Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)
☆34Updated 4 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆53Updated 4 months ago
The-Inscrutable-X / TACQ
Official Repository for Task-Circuit Quantization
☆20Updated 3 weeks ago
ethansmith2000 / TransformerExperiments
☆19Updated last month
JD-P / RetroInstruct
Synthetic data derived by templating, few shot prompting, transformations on public domain corpora, and monte carlo tree search.
☆32Updated 4 months ago
tensoic / Cerule
Cerule - A Tiny Mighty Vision Model
☆66Updated 9 months ago
huggingface / peft-pytorch-conference
Code for the examples presented in the talk "Training a Llama in your backyard: fine-tuning very large models on consumer hardware" given…
☆14Updated last year