SamsungSAILMontreal / TinyRecursiveModelsLinks
☆999Updated this week
Alternatives and similar repositories for TinyRecursiveModels
Users that are interested in TinyRecursiveModels are comparing it to the libraries listed below
Sorting:
- Plotting (entropy, varentropy) for small LMs☆98Updated 4 months ago
- The State Of The Art, intelligence☆152Updated last month
- SoTA Approach for ARC-AGI 2☆97Updated 3 weeks ago
- Fast parallel LLM inference for MLX☆220Updated last year
- Metadspy: The framework for specifying—not programming—language models☆88Updated 3 months ago
- look how they massacred my boy☆63Updated 11 months ago
- A graph visualization of attention☆57Updated 4 months ago
- A simple MLX implementation for pretraining LLMs on Apple Silicon.☆83Updated last month
- ☆166Updated 9 months ago
- smol models are fun too☆93Updated 10 months ago
- Code to train and evaluate Neural Attention Memory Models to obtain universally-applicable memory systems for transformers.☆322Updated 11 months ago
- A comprehensive repository of reasoning tasks for LLMs (and beyond)☆449Updated last year
- ☆269Updated 4 months ago
- smolLM with Entropix sampler on pytorch☆150Updated 11 months ago
- A tree-based prefix cache library that allows rapid creation of looms: hierarchal branching pathways of LLM generations.☆72Updated 7 months ago
- An easy-to-understand framework for LLM samplers that rewind and revise generated tokens☆145Updated 7 months ago
- explore token trajectory trees on instruct and base models☆133Updated 4 months ago
- The Prime Intellect CLI provides a powerful command-line interface for managing GPU resources across various providers☆97Updated this week
- noise_step: Training in 1.58b With No Gradient Memory☆221Updated 9 months ago
- The history files when recording human interaction while solving ARC tasks☆116Updated this week
- ComplexTensor: Machine Learning By Bridging Classical and Quantum Computation☆77Updated 10 months ago
- ☆22Updated 4 months ago
- Claude Deep Research config for Claude Code.☆220Updated 6 months ago
- ☆123Updated last year
- MLX port for xjdr's entropix sampler (mimics jax implementation)☆62Updated 11 months ago
- ☆60Updated 2 months ago
- Implementation of the board game Codenames, re-imagined as a collaborative game between LLM agents☆107Updated 7 months ago
- A framework for optimizing DSPy programs with RL☆185Updated 2 weeks ago
- ☆164Updated last week
- Aidan Bench attempts to measure <big_model_smell> in LLMs.☆312Updated 3 months ago