lucidrains / decomp-opt-pytorch

☆29

Related projects: ⓘ

lucidrains / coordinate-descent-attention
Implementation of an Attention layer where each head can attend to more than just one token, using coordinate descent to pick topk
☆46Updated last year
lucidrains / taylor-series-linear-attention
Explorations into the recently proposed Taylor Series Linear Attention
☆85Updated last month
lucidrains / ReST-EM-pytorch
☆42Updated this week
lucidrains / infini-transformer-pytorch
Implementation of Infini-Transformer in Pytorch
☆100Updated last month
lucidrains / autoregressive-linear-attention-cuda
CUDA implementation of autoregressive linear attention, with all the latest research findings
☆43Updated last year
facebookresearch / lagrangian-ot
Neural Optimal Transport with Lagrangian Costs
☆37Updated 2 months ago
lucidrains / frame-averaging-pytorch
Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network
☆45Updated last month
lucidrains / kalman-filtering-attention
Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"
☆56Updated 10 months ago
lucidrains / mixture-of-attention
Some personal experiments around routing tokens to different autoregressive attention, akin to mixture-of-experts
☆101Updated last year
lucidrains / self-reasoning-tokens-pytorch
Exploration into the proposed "Self Reasoning Tokens" by Felipe Bonetto
☆53Updated 4 months ago
crowsonkb / LDLM
Latent Diffusion Language Models
☆66Updated last year
lucidrains / multimodal-dit-pytorch
Implementation of a multimodal diffusion transformer in Pytorch
☆92Updated 2 months ago
facebookresearch / adaptive_scheduling
Experimental scripts for researching data adaptive learning rate scheduling.
☆23Updated 11 months ago
lucidrains / holodeck-pytorch
Implementation of a holodeck, written in Pytorch
☆17Updated 10 months ago
lucidrains / quartic-transformer
Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)
☆40Updated 7 months ago
lucidrains / sinkhorn-router-pytorch
Self contained pytorch implementation of a sinkhorn based router, for mixture of experts or otherwise
☆31Updated 3 weeks ago
lucidrains / firefly-torch
Exploration into the Firefly algorithm in Pytorch
☆31Updated this week
lucidrains / tranception-pytorch
Implementation of Tranception, an attention network, paired with retrieval, that is SOTA for protein fitness prediction
☆31Updated 2 years ago
lazaratan / meta-flow-matching
Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold
☆33Updated last week
giannisdaras / ambient-tweedie
[ICML 2024]: Official implementation for the paper: "Consistent Diffusion Meets Tweedie"
☆44Updated 4 months ago
lucidrains / magvit-pytorch
☆24Updated this week
lucidrains / spatial-rmsnorm
☆11Updated this week
lucidrains / genetic-algorithm-pytorch
Toy genetic algorithm in Pytorch
☆28Updated 6 months ago
amirzandieh / HyperAttention
Triton Implementation of HyperAttention Algorithm
☆46Updated 9 months ago
lucidrains / hyena-pytorch
☆23Updated this week
Zyphra / Zamba2
PyTorch implementation of models from the Zamba2 series.
☆63Updated last month
lucidrains / AMIE-pytorch
Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind
☆51Updated this week
cloneofsimo / imagenet.int8
☆31Updated 4 months ago
lucidrains / gamengen-pytorch
Implementation of a framework for Gamengen in Pytorch
☆81Updated this week
lucidrains / token-shift-gpt
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆47Updated 2 years ago