zawagner22 / transformers_math_experimentsLinks
☆18Updated 9 months ago
Alternatives and similar repositories for transformers_math_experiments
Users that are interested in transformers_math_experiments are comparing it to the libraries listed below
Sorting:
- Analogous Safe-state Exploration (ASE) is an algorithm for provably safe and optimal exploration in MDPs with unknown, stochastic dynamic…☆11Updated 4 years ago
- [AAAI'21] Modeling Deep Learning Based Privacy Attacks on Physical Mail☆13Updated 4 years ago
- ☆11Updated last year
- Codebase for plCoP, a Prolog Technology Reinforcement Learning Prover☆12Updated 4 years ago
- Implementation of the LOSSGRAD optimization algorithm☆15Updated 6 years ago
- RWKV model implementation☆38Updated 2 years ago
- Symbolic Brittleness in Sequence Models: on Systematic Generalization in Symbolic Mathematics (AAAI 2022)☆14Updated 3 years ago
- LeanAgent is a novel lifelong learning framework for formal theorem proving that continuously generalizes to and improves on ever-expandi…☆27Updated last month
- ☆23Updated 7 months ago
- AdaCat☆49Updated 2 years ago
- Universal Python binding for the LMDB 'Lightning' Database☆13Updated 7 years ago
- Residual Quantization Autoencoder, used for interpreting LLMs☆12Updated 6 months ago
- Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"☆17Updated last year
- An attempt to reimplement the 2013 paper by Wissner-Gross & Freer☆10Updated 6 months ago
- code for paper "Accessing higher dimensions for unsupervised word translation"☆21Updated 2 years ago
- Sparse Circuits on the GPU (ICLR2025)☆12Updated last month
- [Oral; Neurips OPT2024 ] μLO: Compute-Efficient Meta-Generalization of Learned Optimizers☆13Updated 4 months ago
- Code for D. Matthews, S. Kriegman, C. Cappelle and J. Bongard, "Word2vec to behavior: morphology facilitates the grounding of language in…☆15Updated 5 years ago
- Tensorflow 2.x implementation of Gradient Origin Networks☆12Updated 5 years ago
- PyTorch reimplementation of the paper "HyperMixer: An MLP-based Green AI Alternative to Transformers" [arXiv 2022].☆17Updated 3 years ago
- ☆31Updated last year
- Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch☆12Updated 3 years ago
- Fine-Tuning Pre-trained Transformers into Decaying Fast Weights☆19Updated 2 years ago
- Investigate the speed of adaptation of structural causal models☆15Updated 4 years ago
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"☆13Updated 2 years ago
- Kalman Optimization for Value Approximation☆11Updated 5 years ago
- ARLC, a probabilistic abductive reasoner for solving Raven's progressive matrices.☆18Updated 2 months ago
- Official repository for the paper "Exploring the Promise and Limits of Real-Time Recurrent Learning" (ICLR 2024)☆11Updated last month
- Official repository for the paper "Neural Differential Equations for Learning to Program Neural Nets Through Continuous Learning Rules" (…☆23Updated last month
- DiCE: The Infinitely Differentiable Monte-Carlo Estimator☆31Updated last year