unixpickle / sgdstoreView external linksLinks
Augmented RNN memory via live SGD
☆42Apr 25, 2017Updated 8 years ago
Alternatives and similar repositories for sgdstore
Users that are interested in sgdstore are comparing it to the libraries listed below
Sorting:
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆18Mar 15, 2024Updated last year
- Official repository for the paper "Exploring the Promise and Limits of Real-Time Recurrent Learning" (ICLR 2024)☆13Jun 11, 2025Updated 8 months ago
- lanmt ebm☆12Jun 19, 2020Updated 5 years ago
- Combining SOAP and MUON☆19Feb 11, 2025Updated last year
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Oct 9, 2022Updated 3 years ago
- Easily serialize dataclasses to and from tensors (PyTorch, NumPy)☆18Apr 10, 2021Updated 4 years ago
- Curse-of-memory phenomenon of RNNs in sequence modelling☆19May 8, 2025Updated 9 months ago
- A fusion of a linear layer and a cross entropy loss, written for pytorch in triton.☆75Aug 2, 2024Updated last year
- Low-variance and unbiased gradient for backpropagation through categorical random variables, with application in variational auto-encoder…☆17Jul 1, 2020Updated 5 years ago
- PyTorch Implementation of NeurIPS 2020 paper "Learning Sparse Prototypes for Text Generation"☆22Jul 8, 2021Updated 4 years ago
- CUDA 12.2 HMM demos☆20Jul 26, 2024Updated last year
- The official repository for our paper "The Neural Data Router: Adaptive Control Flow in Transformers Improves Systematic Generalization".☆34Jun 11, 2025Updated 8 months ago
- Experiments on the impact of depth in transformers and SSMs.☆40Oct 23, 2025Updated 3 months ago
- Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"☆27Apr 17, 2024Updated last year
- Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021☆29Sep 25, 2021Updated 4 years ago
- Web app to annotate word onsets and offsets on spectrograms☆28Aug 12, 2022Updated 3 years ago
- Frictionless Machine Learning on Kubernetes☆15Mar 7, 2023Updated 2 years ago
- Official PyTorch implementation of Time-aware Large Kernel (TaLK) Convolutions (ICML 2020)☆29Dec 9, 2020Updated 5 years ago
- Implementation for "Rational Recurrences", Peng et al., EMNLP 2018.☆28Jun 21, 2022Updated 3 years ago
- ☆11Jul 29, 2019Updated 6 years ago
- ☆12Dec 20, 2018Updated 7 years ago
- VideoEval: Comprehensive Benchmark Suite for Low-Cost Evaluation of Video Foundation Model☆14Jul 31, 2025Updated 6 months ago
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- Backup your images from pinterest.com☆14Apr 30, 2020Updated 5 years ago
- Identifies music based on microphone input. By request of someone in r/learnprogramming.☆19Dec 8, 2012Updated 13 years ago
- PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.☆12Nov 27, 2024Updated last year
- ☆40May 2, 2021Updated 4 years ago
- Video-aided Unsupervised Grammar Induction, NAACL‘21 [best long paper]☆40Oct 27, 2022Updated 3 years ago
- Worked example of the process from Python source to CUDA kernel execution with Numba☆45Sep 11, 2024Updated last year
- Online streaming speaker change detection model in Pytorch☆44Apr 14, 2023Updated 2 years ago
- ☆66Jul 8, 2025Updated 7 months ago
- Advanced Formal Language Theory (263-5352-00L; Frühjahr 2023)☆10Feb 21, 2023Updated 2 years ago
- [ICLR 2026] [NeurIPS 2025] ViPRA: Video Prediction for Robot Actions☆24Jan 27, 2026Updated 2 weeks ago
- A basic implementation of a Kohonen map in JavaScript☆12Dec 9, 2022Updated 3 years ago
- ☆13Feb 8, 2017Updated 9 years ago
- ☆10May 24, 2021Updated 4 years ago
- Online partial evaluator for pure Prolog programs (with built-ins)☆12Jan 12, 2026Updated last month
- Datasets for compositional learning☆11Nov 28, 2018Updated 7 years ago
- Gaussian Splating 2d implemented in triton☆11Mar 19, 2024Updated last year