SamsungSAILMontreal / ninoLinks
Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [ICLR 2025]
☆26Updated 2 months ago
Alternatives and similar repositories for nino
Users that are interested in nino are comparing it to the libraries listed below
Sorting:
- ☆91Updated last year
- ☆82Updated last year
- ☆29Updated 2 months ago
- Lottery Ticket Adaptation☆40Updated last year
- A repository for research on medium sized language models.☆77Updated last year
- Python package for generating datasets to evaluate reasoning and retrieval of large language models☆18Updated 3 months ago
- Unofficial Implementation of Selective Attention Transformer☆20Updated last year
- Official implementation of Regularized Policy Gradient (RPG) (https://arxiv.org/abs/2505.17508)☆63Updated this week
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆62Updated last year
- Code for the paper Don't Pay Attention☆50Updated 3 months ago
- Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.☆121Updated last month
- Implementation of Mind Evolution, Evolving Deeper LLM Thinking, from Deepmind☆57Updated 7 months ago
- ☆33Updated last year
- Official Project Page for Monadic Context Engineering (https://arxiv.org/abs/2512.22431)☆15Updated last week
- Fork of Flame repo for training of some new stuff in development☆19Updated last week
- ☆56Updated last year
- ☆62Updated last year
- ☆40Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated last year
- ☆152Updated 3 months ago
- UQ: Assessing Language Models on Unsolved Questions☆29Updated 4 months ago
- open source alpha evolve☆68Updated 7 months ago
- JAX Scalify: end-to-end scaled arithmetics☆17Updated last year
- ☆34Updated last year
- Implementation of SOAR☆46Updated 3 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 8 months ago
- ☆43Updated last year
- Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind☆132Updated 2 months ago
- KV Cache Steering for Inducing Reasoning in Small Language Models☆44Updated 5 months ago
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆62Updated 3 months ago