Deep Networks Grok All the Time and Here is Why
☆38May 18, 2024Updated last year
Alternatives and similar repositories for grok-adversarial
Users that are interested in grok-adversarial are comparing it to the libraries listed below
Sorting:
- Exact method for visualizing partitions of a Deep Neural Network, CVPR 2023 Highlight☆110Feb 12, 2025Updated last year
- Relational Features for Planning☆14Feb 18, 2026Updated 2 weeks ago
- Modular optimization library for PyTorch (work-in-progress).☆13Feb 4, 2026Updated last month
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated last year
- Various math art notebooks☆15Mar 9, 2025Updated last year
- ☆15Oct 26, 2021Updated 4 years ago
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Jun 11, 2025Updated 8 months ago
- MarketGPT: Developing a Pre-trained transformer (GPT) for Modeling Financial Time Series☆17Sep 5, 2025Updated 6 months ago
- Code for the paper "Cottention: Linear Transformers With Cosine Attention"☆20Nov 15, 2025Updated 3 months ago
- Graph Learning for Planning☆25Feb 22, 2026Updated 2 weeks ago
- [EMNLP 2024] Tree of Problems: Improving structured problem solving with compositionality☆19Mar 4, 2025Updated last year
- A novel approach for transformer model introspection that enables saving, compressing, and manipulating internal thought states for advan…☆30Updated this week
- ☆23Mar 18, 2024Updated last year
- ☆46Jun 11, 2025Updated 8 months ago
- Codebase from our first release.☆47Feb 17, 2026Updated 2 weeks ago
- ☆27Feb 1, 2023Updated 3 years ago
- Lightweight package that tracks and summarizes code changes using LLMs (Large Language Models)☆34Feb 27, 2025Updated last year
- [ICLR 2025] Implementation of "FACTS: A Factored State-Space Framework For World Modelling"☆29Jun 2, 2025Updated 9 months ago
- ☆29Sep 30, 2025Updated 5 months ago
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆578Jun 28, 2024Updated last year
- Official Code Repository for the paper "Key-value memory in the brain"☆31Feb 25, 2025Updated last year
- ☆31Mar 23, 2024Updated last year
- The official repo of continuous speculative decoding☆31Mar 28, 2025Updated 11 months ago
- A n body simulation of our solar system completed in python☆11Dec 6, 2021Updated 4 years ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆33May 1, 2025Updated 10 months ago
- A tool to paste Excel ranges to Reddit☆11Sep 20, 2025Updated 5 months ago
- Plan✕ is a platform for creating and publishing digital planning services☆17Updated this week
- Big Data Analysis of Tinder done at Universitat Rovira i Virgili and Universitat Politècnica de Catalunya · BarcelonaTech☆13Jan 3, 2023Updated 3 years ago
- An EinSum system in JAX☆18Updated this week
- ☆146Sep 12, 2025Updated 5 months ago
- Multiprocessing in python☆10Aug 20, 2021Updated 4 years ago
- The first OpenSource Mafia Bot!☆10Oct 5, 2023Updated 2 years ago
- This repository contains the Parasol processor, which enables next-generation privacy preserving applications. Users can run arbitrary co…☆11Feb 25, 2026Updated last week
- [NeurIPS 2024] Official implementation of "NeuralClothSim: Neural Deformation Fields Meet the Thin Shell Theory"☆42Oct 29, 2024Updated last year
- Program to plot a Ramachandran plot of all dihedral angles from a given PDB file. Background is empirically generated from the peptides …☆12Feb 25, 2025Updated last year
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- PyTorch library for Active Fine-Tuning☆97Sep 27, 2025Updated 5 months ago
- Linear Attention Sequence Parallelism (LASP)☆89Jun 4, 2024Updated last year
- ☆24Feb 18, 2026Updated 2 weeks ago