Priesemann-Group / Infomorphic_NetworksLinks
☆16Updated last year
Alternatives and similar repositories for Infomorphic_Networks
Users that are interested in Infomorphic_Networks are comparing it to the libraries listed below
Sorting:
- PyTorch implementation for "Long Horizon Temperature Scaling", ICML 2023☆20Updated 2 years ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆53Updated 2 years ago
- ☆16Updated 9 months ago
- [TMLR 2024] Revisiting Random Weight Perturbation for Efficiently Improving Generalization☆12Updated last year
- Implementation of 2-simplicial attention proposed by Clift et al. (2019) and the recent attempt to make practical in Fast and Simplex, Ro…☆47Updated 3 months ago
- Explorations into the recently proposed Taylor Series Linear Attention☆100Updated last year
- Implementation of GateLoop Transformer in Pytorch and Jax☆91Updated last year
- Mixture-of-Experts Multimodal Variational Autoencoder☆14Updated 5 months ago
- Code for lin-RFM used for sparse recovery tasks☆15Updated 8 months ago
- Implementation of Strassen attention, from Kozachinskiy et al. of National Center of AI in Chile☆41Updated 5 months ago
- Recycling diverse models☆46Updated 2 years ago
- Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)☆55Updated 8 months ago
- A tool to convert image of sheet music into an .wav audio file☆18Updated 3 months ago
- A State-Space Model with Rational Transfer Function Representation.☆83Updated last year
- Implementation for robust ViT and scaled attention☆21Updated 8 months ago
- some mixture of experts architecture implementations☆23Updated last year
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆103Updated 11 months ago
- We study toy models of skill learning.☆31Updated 10 months ago
- ☆34Updated last month
- [ACL2024 Findings]DMoERM: Recipes of Mixture-of-Experts for Effective Reward Modeling☆18Updated last year
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆40Updated 2 years ago
- Code for experiments on transformers using Markovian data.☆21Updated last year
- Fork of Flame repo for training of some new stuff in development☆19Updated last week
- The official Pytorch implementation of the paper "Fourier Transformer: Fast Long Range Modeling by Removing Sequence Redundancy with FFT …☆39Updated last year
- Explorations into whether a transformer with RL can direct a genetic algorithm to converge faster☆71Updated 6 months ago
- Causal Attention with Lookahead Keys☆26Updated 2 months ago
- [ICLR 2025] Official Code Release for Explaining Modern Gated-Linear RNNs via a Unified Implicit Attention Formulation☆47Updated 9 months ago
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆57Updated 9 months ago
- https://x.com/BlinkDL_AI/status/1884768989743882276☆28Updated 7 months ago
- The AI-PMS Microservice uses AI to predict aircraft system failures before they occur, optimizing maintenance and enhancing safety. This …☆13Updated last year