Think-a-Tron / evolveLinks

open source alpha evolve

☆66

Alternatives and similar repositories for evolve

Users that are interested in evolve are comparing it to the libraries listed below

Sorting:

lucidrains / mind-evolution
Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind
☆57Updated 4 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆107Updated 7 months ago
bluorion-com / ZClip
Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".
☆136Updated 2 weeks ago
lucidrains / grokfast-pytorch
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
☆102Updated 10 months ago
bloc97 / DeMo
DeMo: Decoupled Momentum Optimization
☆194Updated 10 months ago
tanaymeh / mamba-train
A single repo with all scripts and utils to train / fine-tune the Mamba model with or without FIM
☆59Updated last year
CLAIRE-Labo / EvoTune
Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.
☆116Updated this week
BlinkDL / modded-nanogpt-rwkv
RWKV-7: Surpassing GPT
☆98Updated 11 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 6 months ago
kyleliang919 / Super_Muon
☆65Updated 7 months ago
gkamradt / SnakeBench
☆93Updated 4 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆58Updated last week
joey00072 / Multi-Head-Latent-Attention-MLA-
working implimention of deepseek MLA
☆44Updated 9 months ago
QuixiAI / grokadamw
☆136Updated last year
BorealisAI / neuzip
Official repository for the paper "NeuZip: Memory-Efficient Training and Inference with Dynamic Compression of Neural Networks". This rep…
☆59Updated 11 months ago
lucidrains / llama-qrlhf
Implementation of the Llama architecture with RLHF + Q-learning
☆167Updated 8 months ago
lucidrains / transformer-directed-evolution
Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster
☆71Updated 5 months ago
foundation-model-stack / bamba
Train, tune, and infer Bamba model
☆135Updated 4 months ago
microsoft / ArchScale
Simple & Scalable Pretraining for Neural Architecture Research
☆297Updated 2 months ago
xjdr-alt / llmri
look how they massacred my boy
☆63Updated last year
RWKV / ZeroCoT
https://x.com/BlinkDL_AI/status/1884768989743882276
☆28Updated 5 months ago
axolotl-ai-cloud / axolotl-cookbook
☆36Updated 2 months ago
rom1504 / generic-mcp-client-chat
Generic MCP Client to use any MCP tool in a chat
☆43Updated 5 months ago
OpenPipe / deductive-reasoning
Train your own SOTA deductive reasoning model
☆109Updated 7 months ago
ShadeAlsha / ICon
ICLR 2025 - official implementation for "I-Con: A Unifying Framework for Representation Learning"
☆117Updated 4 months ago
jfpuget / ARC-AGI-Challenge-2024
☆56Updated 11 months ago
KindXiaoming / grow-crystals
Getting crystal-like representations with harmonic loss
☆192Updated 6 months ago
VatsaDev / NanoPoor
NanoGPT-speedrunning for the poor T4 enjoyers
☆72Updated 6 months ago
facebookresearch / matrix
Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…
☆99Updated this week
casper-hansen / OpenCoconut
OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.
☆172Updated 9 months ago