facebookresearch / projUNNLinks

Fast training of unitary deep network layers from low-rank updates

☆28

Alternatives and similar repositories for projUNN

Users that are interested in projUNN are comparing it to the libraries listed below

Sorting:

gregorbachmann / scaling_mlps
☆51Updated last year
vvvm23 / mamba-jax
Unofficial but Efficient Implementation of "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" in JAX
☆84Updated last year
AhmedImtiazPrio / grok-adversarial
Deep Networks Grok All the Time and Here is Why
☆37Updated last year
f-dangel / vivit
[TMLR 2022] Curvature access through the generalized Gauss-Newton's low-rank structure: Eigenvalues, eigenvectors, directional derivative…
☆17Updated last year
shikaiqiu / compute-better-spent
☆53Updated 9 months ago
lucidrains / autoregressive-linear-attention-cuda
CUDA implementation of autoregressive linear attention, with all the latest research findings
☆44Updated 2 years ago
google-deepmind / dks
Multi-framework implementation of Deep Kernel Shaping and Tailored Activation Transformations, which are methods that modify neural netwo…
☆71Updated 2 weeks ago
dylandoblar / noether-networks
Meta-learning inductive biases in the form of useful conserved quantities.
☆37Updated 2 years ago
graphcore-research / out-of-the-box-fp8-training
Demo of the unit_scaling library, showing how a model can be easily adapted to train in FP8.
☆46Updated last year
pytorch / maskedtensor
MaskedTensors for PyTorch
☆38Updated 3 years ago
vvvm23 / mezo-jax
JAX implementation of "Fine-Tuning Language Models with Just Forward Passes"
☆19Updated 2 years ago
sjunhongshen / DASH
☆23Updated 2 years ago
lucidrains / einops-exts
Implementation of some personal helper functions for Einops, my most favorite tensor manipulation library ❤️
☆54Updated 2 years ago
ClashLuke / tpucare
Automatically take good care of your preemptible TPUs
☆36Updated 2 years ago
stanislavfort / dissect-git-re-basin
Replicating and dissecting the git-re-basin project in one-click-replication Colabs
☆36Updated 2 years ago
matthias-wright / jax-fid
FID computation in Jax/Flax.
☆28Updated last year
edwardjhu / TP4
Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)
☆62Updated 4 years ago
AndPotap / einsum-search
☆32Updated 9 months ago
teddykoker / learning-to-learn-jax
JAX implementation of Learning to learn by gradient descent by gradient descent
☆27Updated 9 months ago
srush / triton-autodiff
Experiment of using Tangent to autodiff triton
☆79Updated last year
radarFudan / mamba-minimal-jax
☆31Updated 7 months ago
ssnl / poisson_quasimetric_embedding
Open source code for paper "On the Learning and Learnability of Quasimetrics".
☆32Updated 2 years ago
tychovdo / lila
Code for "Invariance Learning in Deep Neural Networks with Differentiable Laplace Approximations"
☆23Updated 2 years ago
google-deepmind / spectral_ssm
☆32Updated last year
4rtemi5 / imax
Image augmentation library for Jax
☆39Updated last year
toshas / torch-householder
Efficient Householder Transformation in PyTorch
☆66Updated 4 years ago
srush / mamba-scans
Blog post
☆17Updated last year
bhoov / energy-transformer-jax
The Energy Transformer block, in JAX
☆57Updated last year
ermongroup / fast_feedforward_computation
Official code for "Accelerating Feedforward Computation via Parallel Nonlinear Equation Solving", ICML 2021
☆27Updated 3 years ago
IDSIA / recurrent-fwp
Official repository for the paper "Going Beyond Linear Transformers with Recurrent Fast Weight Programmers" (NeurIPS 2021)
☆49Updated last month