demegire / Parameterization-of-Hypercomplex-Multiplications

This is a reproduction of the paper 'Beyond Fully-Connected Layers with Quaternions: Parameterization of Hypercomplex Multiplications with 1/n Parameters' by Ege Demir and Mehmet Barutçu

☆12

Alternatives and similar repositories for Parameterization-of-Hypercomplex-Multiplications:

Users that are interested in Parameterization-of-Hypercomplex-Multiplications are comparing it to the libraries listed below

jason9693 / ETA4LLMs
Calculating Expected Time for training LLM.
☆38Updated last year
microsoft / ResiDual
ResiDual: Transformer with Dual Residual Connections, https://arxiv.org/abs/2304.14802
☆93Updated last year
lucidrains / tableformer-pytorch
Implementation of TableFormer, Robust Transformer Modeling for Table-Text Encoding, in Pytorch
☆37Updated 3 years ago
lsj2408 / URPE
[NeurIPS 2022] Your Transformer May Not be as Powerful as You Expect (official implementation)
☆34Updated last year
microsoft / EfficientLongSequenceModeling
☆51Updated 2 years ago
lucidrains / token-shift-gpt
Implementation of Token Shift GPT - An autoregressive model that solely relies on shifting the sequence space for mixing
☆48Updated 3 years ago
lucidrains / n-grammer-pytorch
Implementation of N-Grammer, augmenting Transformers with latent n-grams, in Pytorch
☆73Updated 2 years ago
lucidrains / memory-editable-transformer
My explorations into editing the knowledge and memories of an attention network
☆34Updated 2 years ago
sooftware / luna-transformer
A PyTorch Implementation of the Luna: Linear Unified Nested Attention
☆41Updated 3 years ago
deep-spin / infinite-former
☆64Updated 7 months ago
lucidrains / gated-state-spaces-pytorch
Implementation of Gated State Spaces, from the paper "Long Range Language Modeling via Gated State Spaces", in Pytorch
☆99Updated 2 years ago
lucidrains / light-recurrent-unit-pytorch
Implementation of a Light Recurrent Unit in Pytorch
☆47Updated 5 months ago
aliutkus / spe
Relative Positional Encoding for Transformers with Linear Complexity
☆62Updated 3 years ago
maum-ai / pnlp-mixer
Unofficial PyTorch Implementation for pNLP-Mixer: an Efficient all-MLP Architecture for Language (https://arxiv.org/abs/2202.04350)
☆63Updated 3 years ago
kyegomez / MultiQueryAttention
This is a simple torch implementation of the high performance Multi-Query Attention
☆16Updated last year
lucidrains / rela-transformer
Implementation of a Transformer using ReLA (Rectified Linear Attention) from https://arxiv.org/abs/2104.07012
☆49Updated 2 years ago
lucidrains / kalman-filtering-attention
Implementation of the Kalman Filtering Attention proposed in "Kalman Filtering Attention for User Behavior Modeling in CTR Prediction"
☆57Updated last year
Doraemonzzz / tnn-pytorch
☆20Updated last year
google-deepmind / randomized_positional_encodings
Randomized Positional Encodings Boost Length Generalization of Transformers
☆80Updated last year
purang2 / prompting-nlp
About, prompt-based few-shot learning, Text Generation with Prompting
☆13Updated last year
lucidrains / memory-compressed-attention
Implementation of Memory-Compressed Attention, from the paper "Generating Wikipedia By Summarizing Long Sequences"
☆70Updated last year
adihaviv / nopos
☆21Updated last year
lucidrains / genetic-algorithm-pytorch
Toy genetic algorithm in Pytorch
☆34Updated last week
Marker-Inc-Korea / OpenFlaminKO
Polyglot을 활용한 image-text multimodal
☆11Updated last year
lucidrains / agent-attention-pytorch
Implementation of Agent Attention in Pytorch
☆90Updated 8 months ago
expz / annotated-hyena
An annotated implementation of the Hyena Hierarchy paper
☆32Updated last year
sc782 / SBM-Transformer
☆13Updated 2 years ago
kaistAI / GAP
[ACL 2023] Gradient Ascent Post-training Enhances Language Model Generalization
☆29Updated 6 months ago
lucidrains / mirasol-pytorch
Implementation of 🌻 Mirasol, SOTA Multimodal Autoregressive model out of Google Deepmind, in Pytorch
☆88Updated last year
cimeister / typical-sampling
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
☆82Updated 3 years ago