lucidrains / HRMLinks
Exploration into the proposed architecture from Sapient Intelligence of Singapore πΈπ¬
β38Updated this week
Alternatives and similar repositories for HRM
Users that are interested in HRM are comparing it to the libraries listed below
Sorting:
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokensβ140Updated 5 months ago
- look how they massacred my boyβ63Updated 9 months ago
- β134Updated 11 months ago
- RWKV-7: Surpassing GPTβ94Updated 8 months ago
- an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)β103Updated 5 months ago
- Train your own SOTA deductive reasoning modelβ103Updated 5 months ago
- smolLM with Entropix sampler on pytorchβ150Updated 9 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.β173Updated 6 months ago
- EvaByte: Efficient Byte-level Language Models at Scaleβ103Updated 3 months ago
- Lego for GRPOβ28Updated 2 months ago
- PyTorch implementation of models from the Zamba2 series.β184Updated 6 months ago
- Automated Capability Discovery via Foundation Model Self-Explorationβ59Updated 5 months ago
- Guaranteed Structured Output from any Language Model via Hierarchical State Machinesβ142Updated 2 months ago
- The simplest, fastest repository for training/finetuning medium-sized xLSTMs.β41Updated last year
- Plotting (entropy, varentropy) for small LMsβ98Updated 2 months ago
- Train an adapter for any embedding model in under a minuteβ109Updated 4 months ago
- entropix style sampling + GUIβ26Updated 9 months ago
- Low-Rank adapter extraction for fine-tuned transformers modelsβ175Updated last year
- Clue inspired puzzles for testing LLM deduction abilitiesβ40Updated 4 months ago
- Simple GRPO scripts and configurations.β59Updated 6 months ago
- NanoGPT-speedrunning for the poor T4 enjoyersβ69Updated 3 months ago
- Memoria is a human-inspired memory architecture for neural networks.β75Updated 9 months ago
- Official repo for Learning to Reason for Long-Form Story Generationβ68Updated 3 months ago
- Implementation of mamba with rustβ88Updated last year
- Testing LLM reasoning abilities with family relationship quizzes.β63Updated 6 months ago
- MLX port for xjdr's entropix sampler (mimics jax implementation)β62Updated 9 months ago
- Alice in Wonderland code base for experiments and raw experiments dataβ131Updated last week
- β27Updated last year
- Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optunaβ55Updated 6 months ago
- Swarming algorithms like PSO, Ant Colony, Sakana, and more in PyTorch πβ129Updated last week