SuReLI / NeurOpsLinks
Implementations of growing and pruning in neural networks
☆22Updated 2 years ago
Alternatives and similar repositories for NeurOps
Users that are interested in NeurOps are comparing it to the libraries listed below
Sorting:
- Minimum Description Length probing for neural network representations☆20Updated last year
- JAX/Flax implementation of the Hyena Hierarchy☆34Updated 2 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated 2 years ago
- An implementation of (Induced) Set Attention Block, from the Set Transformers paper☆67Updated 3 years ago
- Code Release for "Broken Neural Scaling Laws" (BNSL) paper☆59Updated 2 years ago
- Recursive Leasting Squares (RLS) with Neural Network for fast learning☆59Updated 2 years ago
- Code for minimum-entropy coupling.☆32Updated last month
- AdaCat☆49Updated 3 years ago
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆61Updated 3 years ago
- Quantification of Uncertainty with Adversarial Models☆29Updated 2 years ago
- Source-to-Source Debuggable Derivatives in Pure Python☆15Updated 2 years ago
- A python library for highly configurable transformers - easing model architecture search and experimentation.☆49Updated 4 years ago
- Usable implementation of Emerging Symbol Binding Network (ESBN), in Pytorch☆25Updated 5 years ago
- Standalone Product Key Memory module in Pytorch - for augmenting Transformer models☆87Updated 3 months ago
- Google Research☆46Updated 3 years ago
- Embedding Recycling for Language models☆38Updated 2 years ago
- Implementation of a holodeck, written in Pytorch☆18Updated 2 years ago
- Sequence Modeling with Multiresolution Convolutional Memory (ICML 2023)☆127Updated 2 years ago
- Layerwise Batch Entropy Regularization☆24Updated 3 years ago
- Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012☆49Updated 3 years ago
- ☆31Updated 2 weeks ago
- An annotated implementation of the Hyena Hierarchy paper☆34Updated 2 years ago
- Explorations into adversarial losses on top of autoregressive loss for language modeling☆41Updated last month
- Implementation of Gradient Agreement Filtering, from Chaubard et al. of Stanford, but for single machine microbatches, in Pytorch☆25Updated last year
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 3 years ago
- Tensorflow implementation and notebooks for Implicit Maximum Likelihood Estimation☆67Updated 3 years ago
- Implementation of Metaformer, but in an autoregressive manner☆26Updated 3 years ago
- Code for "Counterfactual Token Generation in Large Language Models", Arxiv 2024.☆32Updated last year
- My explorations into editing the knowledge and memories of an attention network☆35Updated 3 years ago