lucidrains / HRMLinks
Exploration into the proposed architecture from Sapient Intelligence of Singapore πΈπ¬
β73Updated 4 months ago
Alternatives and similar repositories for HRM
Users that are interested in HRM are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of models from the Zamba2 series.β186Updated 11 months ago
- β107Updated 5 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.β41Updated last year
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.β344Updated last year
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokensβ150Updated this week
- RWKV-7: Surpassing GPTβ102Updated last year
- DeMo: Decoupled Momentum Optimizationβ198Updated last year
- EvaByte: Efficient Byte-level Language Models at Scaleβ112Updated 8 months ago
- NanoGPT-speedrunning for the poor T4 enjoyersβ73Updated 8 months ago
- Implementation of mamba with rustβ89Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"β103Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.β174Updated 11 months ago
- Train your own SOTA deductive reasoning modelβ107Updated 9 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)β109Updated 9 months ago
- H-Net Dynamic Hierarchical Architectureβ80Updated 3 months ago
- β131Updated last year
- Code for ExploreTomβ89Updated 6 months ago
- This repo contains the source code for the paper "Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning"β279Updated last month
- β137Updated last year
- Alice in Wonderland code base for experiments and raw experiments dataβ131Updated 3 months ago
- Jax Codebase for Evolutionary Strategies at the Hyperscaleβ205Updated last week
- Simple GRPO scripts and configurations.β59Updated 10 months ago
- look how they massacred my boyβ63Updated last year
- Testing LLM reasoning abilities with family relationship quizzes.β63Updated 11 months ago
- open source alpha evolveβ67Updated 7 months ago
- The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languagβ¦β122Updated 2 months ago
- β34Updated 3 weeks ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch πβ135Updated 2 months ago
- Micro Llama is a small Llama based model with 300M parameters trained from scratch with $500 budgetβ163Updated 4 months ago
- 1.58-bit LLaMa modelβ83Updated last year