nikopj / FlashAttention.jlLinks

Julia implementation of flash-attention operation for neural networks.

☆11

Alternatives and similar repositories for FlashAttention.jl

Users that are interested in FlashAttention.jl are comparing it to the libraries listed below

Sorting:

YichengDWu / FlashAttention.jl
Julia implementation of the Flash Attention algorithm
☆19Updated 2 years ago
JuliaMLTools / TransformerBlocks.jl
Simple, blazing fast, transformer components.
☆23Updated 2 years ago
avik-pal / FluxMPI.jl
Distributed Data Parallel Training of Deep Neural Networks
☆57Updated last year
mohamed82008 / DifferentiableFactorizations.jl
Differentiable matrix factorizations using ImplicitDifferentiation.jl.
☆30Updated last year
aced-differentiate / EquivariantOperators.jl
☆19Updated last year
YichengDWu / NeuralGraphPDE.jl
Integrating Neural Ordinary Differential Equations, the Method of Lines, and Graph Neural Networks
☆18Updated last year
SciML / DeepEquilibriumNetworks.jl
Implicit Layer Machine Learning via Deep Equilibrium Networks, O(1) backpropagation with accelerated convergence.
☆57Updated 3 weeks ago
FluxML / XLA.jl
"Maybe we have our own magic."
☆47Updated 5 years ago
rejuvyesh / PyCallChainRules.jl
Differentiate python calls from Julia
☆55Updated 3 years ago
JuliaOrphanage / GraphSignals.jl
Data structures for graph neural network
☆18Updated last year
JuliaGPU / DaggerGPU.jl
GPU integrations for Dagger.jl
☆54Updated 2 months ago
YichengDWu / MoYe.jl
Programming Gemm Kernels on NVIDIA GPUs with Tensor Cores in Julia
☆42Updated 3 weeks ago
yng87 / DDIM.jl
☆22Updated 2 years ago
odsl-team / julia-ml-from-scratch
Machine learning from scratch in Julia
☆32Updated 6 months ago
JuliaGPU / NCCL.jl
A Julia wrapper for the NVIDIA Collective Communications Library.
☆28Updated 2 weeks ago
gaurav-arya / differentiable_mh
Code for paper https://arxiv.org/abs/2306.07961
☆53Updated last year
JuliaGNI / GeometricMachineLearning.jl
Structure Preserving Machine Learning Models in Julia
☆50Updated 2 weeks ago
LuxDL / Boltz.jl
Accelerate your ML research using pre-built Deep Learning Models with Lux
☆42Updated this week
FluxML / Optimisers.jl
Optimisers.jl defines many standard optimisers and utilities for learning loops.
☆89Updated 5 months ago
JuliaOptimalTransport / StochasticOptimalTransport.jl
Julia implementation of stochastic optimization algorithms for large-scale optimal transport.
☆18Updated 4 years ago
nathanaelbosch / Fenrir.jl
Physics-Enhanced Regression for Initial Value Problems
☆20Updated last year
PhilipVinc / Jax.jl
☆28Updated 3 years ago
dfdx / HQDL.jl
Curated list of high-quality operators for deep learning in Julia
☆40Updated 3 years ago
chengchingwen / NeuralAttentionlib.jl
Reusable functionality for defining custom attention/transformer layers.
☆53Updated 3 weeks ago
impICNF / ContinuousNormalizingFlows.jl
Implementations of Infinitesimal Continuous Normalizing Flows Algorithms in Julia
☆27Updated this week
JuliaFolds / FoldsCUDA.jl
Data-parallelism on CUDA using Transducers.jl and for loops (FLoops.jl)
☆57Updated 2 years ago
JuliaFolds2 / BangBang.jl
Immutables as mutables, mutables as immutables.
☆22Updated 6 months ago
MartinuzziFrancesco / CellularAutomata.jl
Cellular automata creation and analysis tools
☆25Updated 3 months ago
FluxML / ParameterSchedulers.jl
Common hyperparameter scheduling for ML
☆36Updated 9 months ago
oschulz / ForwardDiffPullbacks.jl
ChainRulesCore compatible pullbacks using ForwardDiff
☆13Updated 3 weeks ago