myscience / retnet-pytorchLinks

Implementation of Retention-Network in PyTorch

☆16

Alternatives and similar repositories for retnet-pytorch

Users that are interested in retnet-pytorch are comparing it to the libraries listed below

Sorting:

pittisl / mPnP-LLM
Code for paper "Modality Plug-and-Play: Elastic Modality Adaptation in Multimodal LLMs for Embodied AI"
☆11Updated last year
hyeon-jo / interchange-transfer-KD
☆9Updated 2 years ago
GoodbyeKittyy / SIT-INF1003-Mathematics-1
Singapore Institute of Technology - INF1003 Mathematics 1
☆10Updated last year
CEA-LIST / SCE
Implementation of "Similarity Contrastive Estimation for Self-Supervised Soft Contrastive Learning" WACV 2023.
☆25Updated last year
4m4n5 / CLIP-Lite
Pytorch Implementation of CLIP-Lite | Accepted at AISTATS 2023
☆13Updated 2 years ago
kyegomez / MAGVIT2
Open source community's implementation of the model from "LANGUAGE MODEL BEATS DIFFUSION — TOKENIZER IS KEY TO VISUAL GENERATION"
☆15Updated 8 months ago
jaketae / realformer
PyTorch implementation of RealFormer: Transformer Likes Residual Attention
☆11Updated 4 years ago
badripatro / mamba360
State Space Models
☆69Updated last year
gedge-platform / gs-scheduler
scheduler for gedge-platform
☆11Updated last year
Wang-ML-Lab / TSDA
[ICML 2023] Taxonomy-Structured Domain Adaptation
☆12Updated last year
Jinec98 / MAE3D
[IEEE TMM] The official implementation of MAE3D
☆11Updated last year
acceleratedscience / openad-toolkit
Open Accelerated Discovery Toolkit
☆12Updated last month
sjunhongshen / ORCA
Official implementation of ORCA proposed in the paper "Cross-Modal Fine-Tuning: Align then Refine"
☆71Updated last year
hangxu0304 / DeepReduce
A Sparse-tensor Communication Framework for Distributed Deep Learning
☆13Updated 3 years ago
tianyi-lab / R2-T2
[ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"
☆15Updated 4 months ago
abdelfattah-lab / attamba
☆14Updated 7 months ago
towardsautonomy / DatasetEquity
☆13Updated last year
CatworldLee / Gaussian-Mixture-Mask-Attention
☆9Updated 8 months ago
calgaryml / condensed-sparsity
[ICLR 2024] Dynamic Sparse Training with Structured Sparsity
☆18Updated last year
HITESHLPATEL / Mamba-Papers
Awesome Mamba Papers: A Curated Collection of Research Papers , Tutorials & Blogs
☆25Updated last year
kyegomez / MambaFormer
Implementation of MambaFormer in Pytorch ++ Zeta from the paper: "Can Mamba Learn How to Learn? A Comparative Study on In-Context Learnin…
☆21Updated 2 weeks ago
kyegomez / PaLM2-VAdapter
Implementation of "PaLM2-VAdapter:" from the multi-modal model paper: "PaLM2-VAdapter: Progressively Aligned Language Model Makes a Stron…
☆16Updated 8 months ago
phython96 / ARGNP
Pytorch Implementation of Automatic Relation-aware Graph Network Proliferation (CVPR 2022 Oral)
☆26Updated last year
WarlockWendell / AggDet
official implementation of Training-free Boost for Open-Vocabulary Object Detection with Confidence Aggregation
☆12Updated last year
JoelNiklaus / loss_landscape
Code for visualizing the loss landscape of neural nets
☆10Updated 4 years ago
paulilioaica / Differential-Transformer
☆17Updated 9 months ago
leo-yangli / VB-LoRA
This repo contains the source code for VB-LoRA: Extreme Parameter Efficient Fine-Tuning with Vector Banks (NeurIPS 2024).
☆39Updated 9 months ago
GistNoesis / FusedFourierKAN
C++ and Cuda ops for fused FourierKAN
☆80Updated last year
AvivNavon / DWSNets
Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]
☆89Updated last year
bwconrad / soft-moe
PyTorch implementation of "From Sparse to Soft Mixtures of Experts"
☆58Updated last year