qu-gg / torch-hypernetwork-tutorialsLinks
Hypernetwork training considerations and implementation types in PyTorch. Includes classification and time-series examples alongside 1D GroupConv Parallelization.
☆20Updated 2 years ago
Alternatives and similar repositories for torch-hypernetwork-tutorials
Users that are interested in torch-hypernetwork-tutorials are comparing it to the libraries listed below
Sorting:
- Reading list for research topics in state-space models☆316Updated 2 months ago
- ☆65Updated 3 years ago
- Library for Jacobian descent with PyTorch. It enables the optimization of neural networks with multiple losses (e.g. multi-task learning)…☆264Updated this week
- PyTorch implementation of Structured State Space for Sequence Modeling (S4), based on Annotated S4.☆83Updated last year
- Parallelizing non-linear sequential models over the sequence length☆53Updated last month
- Pytorch implementation of Simplified Structured State-Spaces for Sequence Modeling (S5)☆77Updated last year
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆151Updated 6 months ago
- Modern Fixed Point Systems using Pytorch☆103Updated last year
- A minimal PyTorch implementation of the VQ-VAE model described in "Neural Discrete Representation Learning".☆75Updated 3 years ago
- Implementations of various linear RNN layers using pytorch and triton☆53Updated 2 years ago
- Package for working with hypernetworks in PyTorch.☆129Updated last year
- [NeurIPS'24 Oral] Official repository for the paper "Scale Equivariant Graph Metanetworks"☆21Updated 8 months ago
- VQ-VAE/GAN implementation in pytorch-lightning☆45Updated 9 months ago
- ☆137Updated last year
- [ICLR'25] Artificial Kuramoto Oscillatory Neurons☆96Updated this week
- Relative representations can be leveraged to enable solving tasks regarding "latent communication": from zero-shot model stitching to lat…☆60Updated 2 years ago
- Implementation of ST-Moe, the latest incarnation of MoE after years of research at Brain, in Pytorch☆353Updated last year
- Trying out the Mamba architecture on small examples (cifar-10, shakespeare char level etc.)☆48Updated last year
- Code Repository for the ICML 2024 paper: "Towards Scalable and Versatile Weight Space Learning".☆20Updated 11 months ago
- Minimal Implementation of a D3PM in pytorch☆245Updated last year
- Unofficial implementation of Linear Recurrent Units, by Deepmind, in Pytorch☆71Updated 3 months ago
- Official PyTorch Implementation of the Longhorn Deep State Space Model☆54Updated 8 months ago
- Official implementation of Transformer Neural Processes☆78Updated 2 years ago
- Modular and intuitive Hypernetworks in Pytorch☆37Updated last year
- Collection of papers on state-space models☆595Updated 3 months ago
- ☆70Updated 6 months ago
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆74Updated 7 months ago
- ☆298Updated 7 months ago
- Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Mode…☆113Updated 11 months ago
- Discrete Flow Matching implemented in PyTorch☆17Updated 4 months ago