HuyNguyen-hust/flash-attn-101

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/HuyNguyen-hust/flash-attn-101)

HuyNguyen-hust / flash-attn-101

☆22

Alternatives and similar repositories for flash-attn-101

Users that are interested in flash-attn-101 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

viettmab / SA-DPM
View on GitHub
☆16Jan 28, 2024Updated 2 years ago
tranquyenbk173 / FCRE-via-MMI
View on GitHub
Preserving Generalization of Language Models in Few-shot Continual Relation Extraction (EMNLP2024)
☆17Nov 21, 2024Updated last year
ngocbh / trimkv
View on GitHub
[TrimKV] Cache What Lasts: Token Retention for Memory-Bounded KV Cache in LLMs - [DBTrimKV] Make Each Token Count: Towards Improving Lo…
☆15Updated this week
VietHoang1512 / PGA
View on GitHub
Enhancing Domain Adaptation through Prompt Gradient Alignment (NeurIPS 2024)
☆16Jun 16, 2024Updated 2 years ago
VinAIResearch / RecGPT
View on GitHub
RecGPT: Generative Pre-training for Text-based Recommendation (ACL 2024)
☆42Sep 22, 2024Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
nlp-uoregon / ullme
View on GitHub
☆20Apr 8, 2025Updated last year
baochi0212 / LaVy
View on GitHub
Pioneering in Vietnamese Multimodal Large Language Model
☆53Jan 23, 2025Updated last year
duykhuongnguyen / MAT-Steer
View on GitHub
☆21Aug 19, 2025Updated 11 months ago
tranquyenbk173 / BERT_ITE
View on GitHub
Official implementation of "From Implicit to Explicit Feedback: A deep neural network for modeling sequential behaviours and long-short t…
☆19Oct 16, 2025Updated 9 months ago
VinAIResearch / DiMSUM
View on GitHub
DiMSUM: Diffusion Mamba - A Scalable and Unified Spatial-Frequency Method for Image Generation (NeurIPS 2024)
☆47Feb 18, 2025Updated last year
MangoKiller / SimOAR_OAR
View on GitHub
☆11Nov 8, 2023Updated 2 years ago
VietHoang1512 / KPA
View on GitHub
Matching The Statements: A Simple and Accurate Model for Key Point Analysis (ArgMining | EMNLP 2021)
☆12Feb 11, 2022Updated 4 years ago
thanhlexyz / mfea-ii
View on GitHub
MFEA 2 (or MFEA-II). Multifactorial Evolutionary Optimization with Online Transfer Parameter Estimation in Python
☆41Dec 24, 2019Updated 6 years ago
dungkhmt / openerp-micro-service
View on GitHub
☆10Sep 28, 2025Updated 10 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
bangoc / IT4863
View on GitHub
Các tài nguyên chọn lọc cho học phần Tìm kiếm thông tin
☆13Sep 15, 2025Updated 10 months ago
HanGuo97 / hilt
View on GitHub
☆40Dec 14, 2025Updated 7 months ago
VinAIResearch / WaveDiff
View on GitHub
Official Pytorch Implementation of the paper: Wavelet Diffusion Models are fast and scalable Image Generators (CVPR'23)
☆441Jul 23, 2024Updated 2 years ago
trannguyenhan / tiki-data-analysis
View on GitHub
Streaming data of Tiki with Kafka and processing with Spark, visualize with Elasticsearch & Kibana.
☆11Aug 24, 2021Updated 4 years ago
leloykun / modded-nanogpt
View on GitHub
NanoGPT (124M) quality in 2.67B tokens
☆28Sep 17, 2025Updated 10 months ago
imoneoi / bf16_fused_adam
View on GitHub
BFloat16 Fused Adam Operator for PyTorch
☆20Nov 16, 2024Updated last year
trannguyenhan / traveltours
View on GitHub
Watch and book travel tours with Laravel and Vuejs
☆12Jan 18, 2024Updated 2 years ago
pplonski / nlp-apps-mercury
View on GitHub
☆14Feb 22, 2022Updated 4 years ago
smiles724 / MatchExplainer
View on GitHub
☆10Jun 14, 2025Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Confirm-Solutions / dreamy
View on GitHub
Fluent dreaming for language models
☆13Jul 22, 2024Updated 2 years ago
personalizedretrieval / xpert
View on GitHub
Code for XPERT algorithm from Personalized Retrieval over Millions of Items
☆13Sep 14, 2023Updated 2 years ago
SalehMomeni / KLDA
View on GitHub
code and resources for our paper "Achieving Joint Training Accuracy in Continual Learning" in AAAI2025
☆14Feb 25, 2025Updated last year
microsoft / TileFusion
View on GitHub
TileFusion is an experimental C++ macro kernel template library that elevates the abstraction level in CUDA C for tile processing.
☆115Jun 28, 2025Updated last year
yli1 / CLCL
View on GitHub
☆16Mar 14, 2020Updated 6 years ago
VietHoang1512 / khmer-nltk
View on GitHub
Khmer natural language processing toolkit
☆85Mar 17, 2026Updated 4 months ago
yuxwind / CBS
View on GitHub
Official Code of The Combinatorial Brain Surgeon: Pruning Weights That Cancel One Another in Neural Networks[ICML2022]
☆16Sep 20, 2022Updated 3 years ago
tspeterkim / mixed-precision-from-scratch
View on GitHub
Mixed precision training from scratch with Tensors and CUDA
☆30May 14, 2024Updated 2 years ago
BuddiesOfBudgie / docs
View on GitHub
Buddies of Budgie documentation, built with Docusaurus.
☆12Jun 6, 2026Updated last month
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
dame-cell / Triformer
View on GitHub
Transformers components but in Triton
☆34May 9, 2025Updated last year
tone-row / radix-primitives-cheatsheet
View on GitHub
Radix Primitives Cheatsheet
☆11Mar 11, 2022Updated 4 years ago
mybabysexy / ShopeeTotalSpent
View on GitHub
Show total orders and total spent on Shopee and Tiki (VN)
☆22Jan 21, 2020Updated 6 years ago
thelinhbkhn2014 / VnCoreNLP_Wrapper
View on GitHub
☆25Aug 28, 2024Updated last year
imvladikon / jupyter-notebook-viewer
View on GitHub
chrome extension for viewing Jupyter Notebooks in the browser without Jupyter Server
☆30Sep 25, 2025Updated 10 months ago
telexyz / vi
View on GitHub
Xây dựng tập dữ liệu 500GB (20% done) văn bản tiếng Việt để huấn luyện mô hình ngôn ngữ lớn
☆29Apr 7, 2023Updated 3 years ago
Zanette-Labs / speed-rl
View on GitHub
☆18Feb 2, 2026Updated 5 months ago