MLX implementation of Hierarchical Reasoning Model (HRM) - Adaptive computation for complex reasoning tasks
☆28Aug 27, 2025Updated 7 months ago
Alternatives and similar repositories for hrm-mlx
Users that are interested in hrm-mlx are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [ICML'25] Breaking Silos: Adaptive Model Fusion Unlocks Better Time Series Forecasting | 样本级别的自适应多模型集成时间序列预测☆26May 22, 2025Updated 10 months ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated last week
- Notebook examples using mirdata☆12Dec 5, 2023Updated 2 years ago
- Bitcoin Conversions is a twitter bot which replies a twitter mention with a value in bitcoin and a currency, with the actual value in the…☆10Jan 23, 2015Updated 11 years ago
- Clustered Compositional Embeddings☆11Oct 25, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Residual vector quantization for KV cache compression in large language model☆12Oct 22, 2024Updated last year
- A PyTorch native platform for training generative AI models☆16Nov 18, 2025Updated 4 months ago
- Learning Accurate Decision Trees with Bandit Feedback via Quantized Gradient Descent☆16Sep 8, 2022Updated 3 years ago
- ☆23Aug 1, 2025Updated 7 months ago
- ☆13Oct 29, 2021Updated 4 years ago
- ☆16Dec 9, 2023Updated 2 years ago
- [COLM 2025: 1st Workshop on the Application of LLM Explainability to Reasoning and Planning] Latent Chain-of-Thought? Decoding the Depth-…☆17Oct 4, 2025Updated 5 months ago
- ☆21Oct 22, 2025Updated 5 months ago
- Official repo for BWLer: Barycentric Weight Layer☆30Mar 20, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆10Oct 28, 2024Updated last year
- Code accompanying the paper "A contrastive rule for meta-learning"☆13Oct 31, 2024Updated last year
- sigma-MoE layer☆21Jan 5, 2024Updated 2 years ago
- Mixture of Lora Experts☆10Apr 7, 2024Updated last year
- Our EMNLP 2022 paper on VIP-Based Prompting for Parameter-Efficient Learning☆10Oct 22, 2022Updated 3 years ago
- ☆37Jan 13, 2026Updated 2 months ago
- Official implementation of CytoSAE: Interpretable Cell Embeddings for Hematology☆23Jul 17, 2025Updated 8 months ago
- ☆17Mar 4, 2026Updated 3 weeks ago
- ☆51Sep 26, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Differentiable Clustering with Perturbed Random Forests, NeurIPS2023☆13Oct 16, 2023Updated 2 years ago
- ☆11Sep 7, 2024Updated last year
- Model LEGO: Creating Models Like Disassembling and Assembling Building Blocks☆17Jan 15, 2025Updated last year
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Jun 11, 2025Updated 9 months ago
- Transformer Doctor: Diagnosing and Treating Vision Transformers☆11Jan 15, 2025Updated last year
- ☆18Mar 11, 2025Updated last year
- PyTorch implementation of StableMask (ICML'24)☆15Jun 27, 2024Updated last year
- CUDA implementation of Wavelet KAN.☆16Jun 8, 2024Updated last year
- [NIPS 2025] Mixing Expert Knowledge: Bring Human Thoughts Back to The Game of Go. Our model is originally named InternThinker-Go, and cal…☆23Jan 26, 2026Updated 2 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆11Jul 30, 2016Updated 9 years ago
- This repository contains code for the paper "Learning Decision Trees as Amortized Structure Inference"☆16Mar 25, 2025Updated last year
- [EMNLP 24] Source code for paper 'AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tu…☆13Dec 15, 2024Updated last year
- ☆16Mar 1, 2025Updated last year
- A Web App for the game of Go/Baduk/Weiqi. Based on Plotly Dash and GoTextProtocol engines.☆12Apr 10, 2025Updated 11 months ago
- [ACL 2025 Findings] Implicit Reasoning in Transformers is Reasoning through Shortcuts☆17Mar 11, 2025Updated last year
- X-ANFIS: An Extensible and Cross-Learning ANFIS Framework for Machine Learning Tasks☆17Jun 7, 2025Updated 9 months ago