CLAIRE-Labo / EvoTuneLinks
Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.
☆98Updated last month
Alternatives and similar repositories for EvoTune
Users that are interested in EvoTune are comparing it to the libraries listed below
Sorting:
- open source alpha evolve☆62Updated 2 weeks ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆52Updated 2 months ago
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆50Updated this week
- Explorations into the recently proposed Taylor Series Linear Attention☆99Updated 9 months ago
- Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind☆126Updated 9 months ago
- Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster☆70Updated 2 weeks ago
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆100Updated 5 months ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆128Updated 3 weeks ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆127Updated last year
- Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".☆125Updated this week
- [ICLR 2025] Code for the paper "Beyond Autoregression: Discrete Diffusion for Complex Reasoning and Planning"☆60Updated 3 months ago
- Implementation of Infini-Transformer in Pytorch☆111Updated 5 months ago
- Implementation of a multimodal diffusion transformer in Pytorch☆102Updated 11 months ago
- Implementation of the new SOTA for model based RL, from the paper "Improving Transformer World Models for Data-Efficient RL", in Pytorch☆121Updated last month
- Focused on fast experimentation and simplicity☆73Updated 5 months ago
- ☆80Updated last year
- Griffin MQA + Hawk Linear RNN Hybrid☆86Updated last year
- ☆79Updated 9 months ago
- Collection of autoregressive model implementation☆85Updated last month
- ☆95Updated 4 months ago
- Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch☆170Updated 5 months ago
- Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto☆56Updated last year
- A MAD laboratory to improve AI architecture designs 🧪☆116Updated 5 months ago
- ☆78Updated 11 months ago
- supporting pytorch FSDP for optimizers☆79Updated 5 months ago
- Pytorch implementation of Evolutionary Policy Optimization, from Wang et al. of the Robotics Institute at Carnegie Mellon University☆78Updated last week
- ☆44Updated last year
- Attempt to make multiple residual streams from Bytedance's Hyper-Connections paper accessible to the public☆83Updated 3 months ago
- Normalized Transformer (nGPT)☆181Updated 6 months ago
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆40Updated 7 months ago