SamsungSAILMontreal / ninoLinks

Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [ICLR 2025]

☆25

Alternatives and similar repositories for nino

Users that are interested in nino are comparing it to the libraries listed below

Sorting:

OpenMOSS / Lorsa
☆29Updated 2 weeks ago
epfml / DenseFormer
☆82Updated last year
RobertCsordas / moeut
☆88Updated last year
ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆60Updated last year
ml-jku / EVA
One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation
☆45Updated last month
CLAIRE-Labo / EvoTune
Efficiently discovering algorithms via LLMs with evolutionary search and reinforcement learning.
☆119Updated 3 weeks ago
hyperevolnet / Terminator
The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.
☆42Updated 7 months ago
lucidrains / PEER-pytorch
Pytorch implementation of the PEER block from the paper, Mixture of A Million Experts, by Xu Owen He at Deepmind
☆131Updated 3 weeks ago
kilian-group / phantom-wiki
Python package for generating datasets to evaluate reasoning and retrieval of large language models
☆19Updated 2 months ago
GenRobo / MatMamba
Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"
☆61Updated last year
zaydzuhri / flame
Fork of Flame repo for training of some new stuff in development
☆19Updated this week
convergence-ai / lm2
Official repo of paper LM2
☆46Updated 9 months ago
kiddyboots216 / lottery-ticket-adaptation
Lottery Ticket Adaptation
☆40Updated last year
dayal-kalra / low-memory-adam
☆13Updated 8 months ago
rimads / avey-dpa
Code for the paper Don't Pay Attention
☆50Updated last month
katiekang1998 / reasoning_generalization
☆33Updated 10 months ago
rbalestr-lab / llm-jepa
☆130Updated last month
google-deepmind / asyncdiloco
☆47Updated last year
aszala / EnvGen
Official Code Repository for EnvGen: Generating and Adapting Environments via LLMs for Training Embodied Agents (COLM 2024)
☆38Updated last year
idiap / sigma-gpt
σ-GPT: A New Approach to Autoregressive Models
☆69Updated last year
uq-project / UQ
UQ: Assessing Language Models on Unsolved Questions
☆28Updated 2 months ago
shangshang-wang / Resa
Resa: Transparent Reasoning Models via SAEs
☆44Updated 2 months ago
The-Inscrutable-X / TACQ
Official Repository for Task-Circuit Quantization
☆24Updated 5 months ago
TRI-ML / linear_open_lm
A repository for research on medium sized language models.
☆78Updated last year
apoorvkh / academic-pretraining
$100K or 100 Days: Trade-offs when Pre-Training with Academic Resources
☆147Updated last month
ml-jku / hopfield-boosting
☆33Updated last year
arcee-ai / DAM
☆55Updated last year
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆45Updated last year
RWKV / ZeroCoT
https://x.com/BlinkDL_AI/status/1884768989743882276
☆28Updated 6 months ago
lucidrains / grokfast-pytorch
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
☆103Updated 11 months ago