Sea-Snell / MLLibCpp

A machine learning library capable of training various deep neural networks (RNNs, LSTMs, DBNs, ect...) on a GPU. It makes use of auto-differentiation algorithms. Written in C++ with OpenCl.

☆11

Related projects ⓘ

Alternatives and complementary repositories for MLLibCpp

lucidrains / quartic-transformer
Exploring an idea where one forgets about efficiency and carries out attention across each edge of the nodes (tokens)
☆43Updated last month
lucidrains / memory-editable-transformer
My explorations into editing the knowledge and memories of an attention network
☆34Updated last year
codekansas / rwkv
RWKV model implementation
☆38Updated last year
google-research / precondition
☆29Updated 2 weeks ago
lucidrains / token-shift-gpt
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆47Updated 2 years ago
lucidrains / rela-transformer
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Updated 2 years ago
lucidrains / autoregressive-linear-attention-cuda
CUDA implementation of autoregressive linear attention, with all the latest research findings
☆43Updated last year
srush / triton-autodiff
Experiment of using Tangent to autodiff triton
☆72Updated 9 months ago
amirzandieh / HyperAttention
Triton Implementation of HyperAttention Algorithm
☆46Updated 11 months ago
YeonwooSung / GLOM
PyTorch implementation of GLOM
☆21Updated 2 years ago
RobertCsordas / moe
Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"
☆36Updated 11 months ago
renll / SeqBoat
[NeurIPS 2023] Sparse Modular Activation for Efficient Sequence Modeling
☆35Updated 11 months ago
HomebrewNLP / Olmax
HomebrewNLP in JAX flavour for maintable TPU-Training
☆46Updated 9 months ago
ahennequ / pytorch-custom-mma
☆29Updated 2 years ago
RobertCsordas / moe_layer
sigma-MoE layer
☆18Updated 10 months ago
lucidrains / deep-linear-network
A simple implementation of a deep linear Pytorch module
☆18Updated 4 years ago
lucidrains / all-normalization-transformer
A simple Transformer where the softmax has been replaced with normalization
☆18Updated 4 years ago
BlinkDL / SmallInitEmb
LayerNorm(SmallInit(Embedding)) in a Transformer to improve convergence
☆45Updated 2 years ago
proger / nanokitchen
Parallel Associative Scan for Language Models
☆18Updated 10 months ago
tobiaskatsch / GatedLinearRNN
☆23Updated 8 months ago
CyndxAI / QKNorm
Code for the paper "Query-Key Normalization for Transformers"
☆34Updated 3 years ago
xiayuqing0622 / customized-flash-attention
Fast and memory-efficient exact attention
☆26Updated this week
HomebrewNLP / HomebrewNLP
A case study of efficient training of large language models using commodity hardware.
☆68Updated 2 years ago
lucidrains / gateloop-transformer
Implementation of GateLoop Transformer in Pytorch and Jax
☆86Updated 4 months ago
lucidrains / product-key-memory
Standalone Product Key Memory module in Pytorch - for augmenting Transformer models
☆72Updated 3 months ago
kyegomez / Blockwise-Parallel-Transformer
32 times longer context window than vanilla Transformers and up to 4 times longer than memory efficient Transformers.
☆42Updated last year