fal-ai-community / llmdifftrackerLinks

Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)

☆33

Alternatives and similar repositories for llmdifftracker

Users that are interested in llmdifftracker are comparing it to the libraries listed below

Sorting:

fal-ai / diffusion-speedrun
Focused on fast experimentation and simplicity
☆76Updated 7 months ago
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆69Updated 3 months ago
kubernetes-bad / reward-composer
Lego for GRPO
☆28Updated 2 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆103Updated 5 months ago
cloneofsimo / infinite-fractal-stream
☆30Updated 10 months ago
SonicCodes / lucid-v1
realtime latent world model inference demo
☆47Updated 8 months ago
evanatyourservice / llm-jax
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆19Updated 2 weeks ago
main-horse / hnet
H-Net Dynamic Hierarchical Architecture
☆65Updated 2 weeks ago
lucidrains / grokfast-pytorch
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
☆101Updated 7 months ago
gau-nernst / kokoro
https://hf.co/hexgrad/Kokoro-82M
☆14Updated 5 months ago
evanatyourservice / kron_torch
An implementation of PSGD Kron second-order optimizer for PyTorch
☆94Updated 2 weeks ago
RWKV / ZeroCoT
https://x.com/BlinkDL_AI/status/1884768989743882276
☆28Updated 3 months ago
SonicCodes / subcloning
implementation of https://arxiv.org/pdf/2312.09299
☆21Updated last year
fal-ai-community / nano-mdm
Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun
☆55Updated 4 months ago
bloc97 / DeMo
DeMo: Decoupled Momentum Optimization
☆190Updated 8 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 3 months ago
cloneofsimo / project_RF
☆24Updated last year
rimads / avey-dpa
Code for the paper Don't Pay Attention
☆48Updated last month
xjdr-alt / muzero_sketch
☆38Updated last year
idiap / sigma-gpt
σ-GPT: A New Approach to Autoregressive Models
☆67Updated 11 months ago
cloneofsimo / min-max-gpt
Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training
☆130Updated last year
joey00072 / Multi-Head-Latent-Attention-MLA-
working implimention of deepseek MLA
☆42Updated 7 months ago
SwayStar123 / reimei
☆24Updated 3 months ago
BlinkDL / modded-nanogpt-rwkv
RWKV-7: Surpassing GPT
☆94Updated 8 months ago
kyegomez / OpenStrawberry
An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO
☆31Updated last week
sayakpaul / simple-image-recaptioning
Recaption large (Web)Datasets with vllm and save the artifacts.
☆52Updated 8 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆55Updated 6 months ago
euclaise / supertrainer2000
☆49Updated last year
ethansmith2000 / TransformerExperiments
☆19Updated 2 months ago
kyleliang919 / Super_Muon
☆60Updated 4 months ago