Augmented RNN memory via live SGD
☆42Apr 25, 2017Updated 8 years ago
Alternatives and similar repositories for sgdstore
Users that are interested in sgdstore are comparing it to the libraries listed below
Sorting:
- lanmt ebm☆12Jun 19, 2020Updated 5 years ago
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆75Aug 2, 2024Updated last year
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated 2 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- Combining SOAP and MUON☆19Feb 11, 2025Updated last year
- Easily serialize dataclasses to and from tensors (PyTorch, NumPy)☆18Apr 10, 2021Updated 4 years ago
- Capsule networks can defend against adversarial attacks using reconstruction error☆13May 24, 2018Updated 7 years ago
- ☆17Oct 31, 2023Updated 2 years ago
- PyTorch Implementation of NeurIPS 2020 paper "Learning Sparse Prototypes for Text Generation"☆22Jul 8, 2021Updated 4 years ago
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19May 8, 2025Updated 10 months ago
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆29Sep 25, 2021Updated 4 years ago
- CUDA 12.2 HMM demos☆20Jul 26, 2024Updated last year
- Experiments on the impact of depth in transformers and SSMs.☆41Oct 23, 2025Updated 4 months ago
- Tex template for SPEIT engineer internship report.☆12Nov 1, 2019Updated 6 years ago
- Towards an implementation of hierarchical temporal memory and the cortical learning algorithm by Jeff Hawkins and Dileep George of Nument…☆12Mar 15, 2017Updated 9 years ago
- A CLOS implementation of an in memory hypergraph database and semantic networks.☆11Feb 16, 2021Updated 5 years ago
- Marimekko and bar mekko graphics in R☆10Jun 7, 2025Updated 9 months ago
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆34Jun 11, 2025Updated 9 months ago
- Visual programming language: SKetches of Abstract Syntax Trees. I. C.☆10Jan 14, 2022Updated 4 years ago
- Repository for the code and dataset for the paper: "Have LLMs Advanced enough? Towards Harder Problem Solving Benchmarks For Large Langu…☆39Dec 18, 2023Updated 2 years ago
- A basic implementation of a Kohonen map in JavaScript☆12Dec 9, 2022Updated 3 years ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32May 25, 2024Updated last year
- S2APLER: S2 Agglomeration of Papers with Low Error Rate (it's for academic paper clustering)☆21Nov 4, 2025Updated 4 months ago
- Official PyTorch implementation of Time-aware Large Kernel (TaLK) Convolutions (ICML 2020)☆29Dec 9, 2020Updated 5 years ago
- Embroid: Unsupervised Prediction Smoothing Can Improve Few-Shot Classification☆11Aug 12, 2023Updated 2 years ago
- A softmax multi-armed bandit algorithm☆12Dec 30, 2018Updated 7 years ago
- Pytorch implementation of Count-ception and custom CNN counting models for Kaggle Sea Lion Count challenge☆10Jun 30, 2017Updated 8 years ago
- The bootstrapping PEG parser☆10Feb 18, 2024Updated 2 years ago
- Rethinking Bottleneck Structure for Efficient Mobile Network Design☆12Jul 22, 2020Updated 5 years ago
- Implementation of generative semantic grammar.☆17Jun 2, 2022Updated 3 years ago