catie-aq/flashT5

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/catie-aq/flashT5)

catie-aq / flashT5

A fast implementation of T5/UL2 in PyTorch using Flash Attention

☆116

Alternatives and similar repositories for flashT5

Users that are interested in flashT5 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Knowledgator / TurboT5
View on GitHub
Truly flash T5 realization!
☆77Jan 26, 2026Updated 5 months ago
EleutherAI / improved-t5
View on GitHub
Experiments for efforts to train a new and improved t5
☆76Apr 15, 2024Updated 2 years ago
Knowledgator / FlashDeBERTa
View on GitHub
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆90Feb 10, 2026Updated 5 months ago
haileyschoelkopf / triton-index
View on GitHub
See https://github.com/cuda-mode/triton-index/ instead!
☆11May 8, 2024Updated 2 years ago
lucidrains / memory-editable-transformer
View on GitHub
My explorations into editing the knowledge and memories of an attention network
☆35Dec 8, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
davisyoshida / jax-gptq
View on GitHub
JAX implementation of GPTQ quantization algorithm
☆10Jul 19, 2023Updated 2 years ago
lbox-kr / kbl
View on GitHub
Korean Benchmark for Korean Legal Language Understanding
☆19Nov 16, 2024Updated last year
PiotrNawrot / nanoT5
View on GitHub
Fast & Simple repository for pre-training and fine-tuning T5-style models
☆1,022Aug 21, 2024Updated last year
proger / accelerated-scan
View on GitHub
Accelerated First Order Parallel Associative Scan
☆198Jan 7, 2026Updated 6 months ago
zbambergerNLP / principled-pre-training
View on GitHub
A repository to get acquainted with basic training tasks in natural language processing and machine learning
☆11Dec 27, 2023Updated 2 years ago
HazyResearch / lolcats
View on GitHub
Repo for "LoLCATs: On Low-Rank Linearizing of Large Language Models"
☆260Jan 31, 2025Updated last year
theblackcat102 / unify-learning-paradigms
View on GitHub
data collator for UL2 and U-PaLM
☆29Aug 20, 2023Updated 2 years ago
sophiaalthammer / parm
View on GitHub
This repository contains the code for the paper 'PARM: Paragraph Aggregation Retrieval Model for Dense Document-to-Document Retrieval' pu…
☆41Jan 5, 2022Updated 4 years ago
AlexWan0 / infini-gram
View on GitHub
An unofficial implementation of the Infini-gram model proposed by Liu et al. (2024)
☆33Jun 19, 2024Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
castorini / hf-spacerini
View on GitHub
Plug-and-play Search Interfaces with Pyserini and Hugging Face
☆31Aug 5, 2023Updated 2 years ago
google-research / t5x_retrieval
View on GitHub
☆102Dec 17, 2022Updated 3 years ago
mikex86 / tritonc
View on GitHub
Standalone commandline CLI tool for compiling Triton kernels
☆20Sep 13, 2024Updated last year
Elbria / xformal-FoST
View on GitHub
Code and data for the NAACL 2021 paper: "XFORMAL: A Benchmark for Multilingual Formality Style Transfer"
☆12Jun 7, 2021Updated 5 years ago
emory-irlab / pyterrier_genrank
View on GitHub
Generative Reranker PyTerrier
☆18Dec 1, 2025Updated 7 months ago
HansiZeng / scaling-retriever
View on GitHub
[SIGIR 2025] The official repo for "Scaling Sparse and Dense Retrieval in Decoder-Only LLMs"
☆22Mar 31, 2025Updated last year
OpenGreekAndLatin / Latin
View on GitHub
XML files for Latin texts that are not found in existing repos
☆10Feb 25, 2025Updated last year
qdrant / quaterion-models
View on GitHub
The collection of bulding blocks building fine-tunable metric learning models
☆35Jul 6, 2026Updated last week
Knowledgator / GLiClass
View on GitHub
Generalist and Lightweight Model for Text Classification
☆231Updated this week
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
huggingface / olm-training
View on GitHub
Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.
☆98Feb 9, 2023Updated 3 years ago
huggingface / leaderboards
View on GitHub
☆23May 26, 2026Updated last month
MinishLab / tokenlearn
View on GitHub
Pre-train Static Word Embeddings
☆107Jun 9, 2026Updated last month
ielab / Starbucks
View on GitHub
Starbucks: Improved Training for 2D Matryoshka Embeddings
☆25Jun 30, 2025Updated last year
HansiZeng / CL-DRD
View on GitHub
[SIGIR 2022] The official repo for the paper "Curriculum Learning for Dense Retrieval Distillation".
☆23Apr 29, 2022Updated 4 years ago
jquesnelle / ctranslate2-rs
View on GitHub
Rust bindings for CTranslate2
☆14Jun 21, 2023Updated 3 years ago
Escavador / vespa-php
View on GitHub
PHP low-level client for Vespa. https://vespa.ai/
☆17Jan 22, 2026Updated 5 months ago
neuralmind-ai / information-extraction-t5
View on GitHub
☆12Apr 29, 2022Updated 4 years ago
owos / flexitokens
View on GitHub
FlexiTokens
☆23Dec 27, 2025Updated 6 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
Artur-Galstyan / statedict2pytree
View on GitHub
☆47Feb 26, 2026Updated 4 months ago
mmxgn / miniepy
View on GitHub
Open Information Extraction system - Python wrapper
☆22Oct 7, 2021Updated 4 years ago
alon-albalak / FLAD
View on GitHub
Few-shot Learning with Auxiliary Data
☆31Dec 8, 2023Updated 2 years ago
machelreid / m2d2
View on GitHub
M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer
☆54Nov 21, 2022Updated 3 years ago
sustcsonglin / mamba-triton
View on GitHub
☆52Jan 28, 2024Updated 2 years ago
CHARM-Tx / linear_mem_attention_pytorch
View on GitHub
Unofficially Implements https://arxiv.org/abs/2112.05682 to get Linear Memory Cost on Attention for PyTorch
☆12Jan 16, 2022Updated 4 years ago
proger / hippogriff
View on GitHub
Griffin MQA + Hawk Linear RNN Hybrid
☆89Apr 13, 2026Updated 3 months ago