leibovit / Sparse-Linear-NetworksLinks
Code to accompany the paper Sparse Linear Networks with a Fixed Butterfly Structure: Theory and Practice
☆10Updated 4 years ago
Alternatives and similar repositories for Sparse-Linear-Networks
Users that are interested in Sparse-Linear-Networks are comparing it to the libraries listed below
Sorting:
- ☆33Updated 2 years ago
- ESGD-M is a stochastic non-convex second order optimizer, suitable for training deep learning models, for PyTorch.☆56Updated 3 years ago
- Code for ICLR 2021 Paper, "Anytime Sampling for Autoregressive Models via Ordered Autoencoding"☆26Updated 2 years ago
- Official code repository for Instance Selection for GANs.☆44Updated 4 years ago
- Official code for paper "Non-Adversarial Image Synthesis with Generative Latent Nearest Neighbors"☆28Updated 5 years ago
- AdaCat☆49Updated 3 years ago
- Implementation of LogAvgExp for Pytorch☆36Updated 5 months ago
- ☆15Updated 4 years ago
- Official repository for MaGNET, ICLR 2022☆24Updated 2 years ago
- Implementation of Metaformer, but in an autoregressive manner☆27Updated 3 years ago
- Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing☆50Updated 3 years ago
- Script and models for clustering LAION-400m CLIP embeddings.☆26Updated 3 years ago
- Unified API to facilitate usage of pre-trained "perceptor" models, a la CLIP☆40Updated 2 years ago
- Pedagogical codebase for a simplified score-based generative model design, with training loop☆40Updated 4 years ago
- Implementation of Hourglass Transformer, in Pytorch, from Google and OpenAI☆94Updated 3 years ago
- Unsupervised diverse image generation via GANs: Partition Guided Mixture of Generative Adversarial Networks☆13Updated 3 years ago
- CLOOB training (JAX) and inference (JAX and PyTorch)☆73Updated 3 years ago
- Latent Diffusion Language Models☆69Updated 2 years ago
- A new play-and-plug method of controlling an existing generative model with conditioning attributes and their compositions.☆73Updated 3 years ago
- Repository for the PopulAtion Parameter Averaging (PAPA) paper☆27Updated last year
- ☆19Updated 4 years ago
- Re-implementation of 'Grokking: Generalization beyond overfitting on small algorithmic datasets'☆38Updated 3 years ago
- ☆39Updated last year
- Differentiable FFT Conv Layer with Dense Color Channels☆11Updated 3 years ago
- Another attempt at a long-context / efficient transformer by me☆38Updated 3 years ago
- ☆64Updated 3 years ago
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆38Updated 3 months ago
- codebase for the SIMAT dataset and evaluation☆38Updated 3 years ago
- Official code for the paper "Attention as a Hypernetwork"☆42Updated last year
- Inverts CLIP text embeds to image embeds and visualizes with deep-image-prior.☆35Updated 3 years ago