Deep Networks Grok All the Time and Here is Why
☆38May 18, 2024Updated last year
Alternatives and similar repositories for grok-adversarial
Users that are interested in grok-adversarial are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Kronecker Attention in Pytorch☆19Sep 12, 2020Updated 5 years ago
- Exact method for visualizing partitions of a Deep Neural Network, CVPR 2023 Highlight☆112Feb 12, 2025Updated last year
- Various math art notebooks☆15Mar 9, 2025Updated last year
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated last year
- Sketching-based matrix computations for numpy arrays☆17Oct 29, 2019Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- The official repository for our paper "The Dual Form of Neural Networks Revisited: Connecting Test Time Predictions to Training Patterns …☆16Jun 11, 2025Updated 9 months ago
- An EinSum system in JAX☆18Mar 6, 2026Updated 3 weeks ago
- Graph Learning for Planning☆25Feb 22, 2026Updated last month
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆13Nov 17, 2024Updated last year
- A powerful white-box adversarial attack that exploits knowledge about the geometry of neural networks to find minimal adversarial perturb…☆12Aug 5, 2020Updated 5 years ago
- Spectral Attention Autoregressive Model (SAAM)☆16Oct 27, 2022Updated 3 years ago
- Official codebase for the "A Neuro-Symbolic Benchmark Suite for Concept Quality and Reasoning Shortcuts" benchmark paper.☆11Feb 3, 2025Updated last year
- ☆18Dec 2, 2024Updated last year
- Official repository for the paper "Grokfast: Accelerated Grokking by Amplifying Slow Gradients"☆578Jun 28, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Train a bidirectional or normal LSTM recurrent neural network to generate text on a free GPU using any dataset. Just upload your text fil…☆12Jan 29, 2019Updated 7 years ago
- Research Papers on Efficient Neural Fields from EffL Group☆16Apr 21, 2025Updated 11 months ago
- ☆12Mar 7, 2024Updated 2 years ago
- Automatically take good care of your preemptible TPUs☆37May 15, 2023Updated 2 years ago
- ☆46Jun 11, 2025Updated 9 months ago
- RevBiFPN: The Fully Reversible Bidirectional Feature Pyramid Network☆15Oct 18, 2022Updated 3 years ago
- ☆24Sep 25, 2024Updated last year
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆71Sep 25, 2024Updated last year
- Accelerated First Order Parallel Associative Scan☆196Jan 7, 2026Updated 2 months ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- ☆21Mar 10, 2026Updated 2 weeks ago
- Jax implementation of the AdaHessian optimizer☆20Mar 11, 2021Updated 5 years ago
- Official PyTorch implementation of CD-MOE☆12Mar 18, 2026Updated last week
- A Python wrapper around the Game Boy Advance emulator mGBA with built-in support for gymnasium environments.☆25Jul 28, 2025Updated 8 months ago
- Mixed integer programming for computing lipschitz constants of ReLU Networks☆17Feb 10, 2023Updated 3 years ago
- Official Repository for ICML 2023 paper "Can Neural Network Memorization Be Localized?"☆21Oct 26, 2023Updated 2 years ago
- The official repo of continuous speculative decoding☆32Mar 28, 2025Updated last year
- Official Code Repository for the paper "Key-value memory in the brain"☆31Feb 25, 2025Updated last year
- [NeurIPS 2024] Official implementation of "NeuralClothSim: Neural Deformation Fields Meet the Thin Shell Theory"☆42Oct 29, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆24Dec 11, 2024Updated last year
- Code and Data for Paper: PanoGen: Text-Conditioned Panoramic Environment Generation for Vision-and-Language Navigation☆80May 31, 2023Updated 2 years ago
- collection of example documents for use within cocalc's library☆17Sep 11, 2025Updated 6 months ago
- H-Net Dynamic Hierarchical Architecture☆81Sep 11, 2025Updated 6 months ago
- Code for "What really matters in matrix-whitening optimizers?"☆23Oct 31, 2025Updated 4 months ago
- A collection of optimizers, some arcane others well known, for Flax.☆29Aug 6, 2021Updated 4 years ago
- Bayesian optimization with Standard Gaussian Processes on high dimensional benchmarks☆22Jun 29, 2025Updated 9 months ago