AwesomeSeq/Comba-triton

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AwesomeSeq/Comba-triton)

AwesomeSeq / Comba-triton

☆47

Alternatives and similar repositories for Comba-triton

Users that are interested in Comba-triton are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

OpenSparseLLMs / Linearization
View on GitHub
☆71Jul 8, 2025Updated last year
OpenSparseLLMs / Linear-MoE
View on GitHub
☆139Jun 6, 2025Updated last year
OpenSparseLLMs / MoM
View on GitHub
☆139Feb 4, 2026Updated 5 months ago
fla-org / fla-zoo
View on GitHub
Flash-Linear-Attention models beyond language
☆21Aug 28, 2025Updated 11 months ago
automl / unlocking_state_tracking
View on GitHub
Expanding linear RNN state-transition matrix eigenvalues to include negatives improves state-tracking tasks and language modeling without…
☆22Mar 15, 2025Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
AlirezaMorsali / MLP-Attention
View on GitHub
☆17Dec 19, 2024Updated last year
automl / DeltaProduct
View on GitHub
DeltaProduct is a new linear recurrent neural network architecture that uses products of generalized Householder matrices as state-transi…
☆15Oct 13, 2025Updated 9 months ago
syf1844803351 / FICLRec
View on GitHub
☆18Jun 12, 2025Updated last year
glassroom / heinsen_attention
View on GitHub
Reference implementation of "Softmax Attention with Constant Cost per Token" (Heinsen, 2024)
☆25Jun 6, 2024Updated 2 years ago
zhixuan-lin / forgetting-transformer
View on GitHub
[ICLR 2025 & COLM 2025] Official PyTorch implementation of the Forgetting Transformer and Adaptive Computation Pruning
☆150Feb 25, 2026Updated 5 months ago
Jellyfish042 / LongMamba
View on GitHub
Some preliminary explorations of Mamba's context scaling.
☆14Dec 18, 2024Updated last year
zjuml / Affirm
View on GitHub
☆16Jan 16, 2025Updated last year
jungokasai / T2R
View on GitHub
☆14Nov 20, 2022Updated 3 years ago
declare-lab / EFLA
View on GitHub
Official Repo for Error-Free Linear Attention is a Free Lunch: Exact Solution from Continuous-Time Dynamics
☆76Mar 26, 2026Updated 4 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
goombalab / raven
View on GitHub
☆78May 29, 2026Updated 2 months ago
bdusell / stack-attention
View on GitHub
Code for the paper "Stack Attention: Improving the Ability of Transformers to Model Hierarchical Patterns"
☆18Mar 15, 2024Updated 2 years ago
sudaada / FEARec
View on GitHub
☆32Apr 17, 2023Updated 3 years ago
00ffcc / chunkRWKV6
View on GitHub
continous batching and parallel acceleration for RWKV6
☆23Jun 28, 2024Updated 2 years ago
HazyResearch / zoology
View on GitHub
Understand and test language model architectures on synthetic tasks.
☆279Mar 22, 2026Updated 4 months ago
HazyResearch / prefix-linear-attention
View on GitHub
☆62Jul 9, 2024Updated 2 years ago
yikangshen / megablocks
View on GitHub
☆20May 30, 2024Updated 2 years ago
fla-org / flash-bidirectional-linear-attention
View on GitHub
Triton implement of bi-directional (non-causal) linear attention
☆79Mar 1, 2026Updated 4 months ago
chuanyang-Zheng / DAPE
View on GitHub
The this is the official implementation of "DAPE: Data-Adaptive Positional Encoding for Length Extrapolation"
☆41Oct 11, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Doraemonzzz / hgru-pytorch
View on GitHub
☆29Jul 9, 2024Updated 2 years ago
GindaChen / FlexFlashAttention3
View on GitHub
FlexAttention w/ FlashAttention3 Support
☆27Oct 5, 2024Updated last year
mit-han-lab / flash-moba
View on GitHub
☆251Nov 19, 2025Updated 8 months ago
Doraemonzzz / nanoTransNormer
View on GitHub
☆11Oct 11, 2023Updated 2 years ago
da03 / criticize_text_generation
View on GitHub
A method for evaluating the high-level coherence of machine-generated texts. Identifies high-level coherence issues in transformer-based …
☆12Mar 18, 2023Updated 3 years ago
OpenNLPLab / LASP
View on GitHub
Linear Attention Sequence Parallelism (LASP)
☆87Jun 4, 2024Updated 2 years ago
zzzhr97 / SpecBench
View on GitHub
Code repository for the ICML 2026 paper "Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Deliberation".
☆24Jun 14, 2026Updated last month
jenni-ai / T2FW
View on GitHub
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆20Oct 9, 2022Updated 3 years ago
magic-research / vector_quantization
View on GitHub
[NeurIPS 2024] Image Understanding Makes for A Good Tokenizer for Image Generation
☆21Dec 17, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
OpenNLPLab / HGRN2
View on GitHub
HGRN2: Gated Linear RNNs with State Expansion
☆58Aug 20, 2024Updated last year
mcoavoux / mtg
View on GitHub
Statistical discontinuous constituent parsing
☆11Feb 15, 2018Updated 8 years ago
HanGuo97 / log-linear-attention
View on GitHub
☆284Jun 6, 2025Updated last year
OpenNLPLab / lightning-attention
View on GitHub
Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models
☆344Feb 23, 2025Updated last year
robert-lieck / RBN
View on GitHub
Recursive Bayesian Networks
☆11May 11, 2025Updated last year
Noahs-ARK / PaLM
View on GitHub
PyTorch implementation for PaLM: A Hybrid Parser and Language Model.
☆10Jan 7, 2020Updated 6 years ago
Doraemonzzz / Transformer-Evolution-Paper
View on GitHub
记录Transformer升级的论文笔记
☆19Jun 25, 2023Updated 3 years ago