HuyNguyen-hust / flash-attn-101
☆22Updated 8 months ago
Alternatives and similar repositories for flash-attn-101
Users that are interested in flash-attn-101 are comparing it to the libraries listed below
Sorting:
- Pioneering in Vietnamese Multimodal Large Language Model☆47Updated 3 months ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated last year
- ☆69Updated last year
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated 10 months ago
- PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)☆43Updated 9 months ago
- ☆13Updated 2 years ago
- [CVPR 2025] h-Edit: Effective and Flexible Diffusion-Based Editing via Doob’s h-Transform☆45Updated last month
- Bud500: A Comprehensive Vietnamese ASR Dataset☆66Updated last year
- Memory-Efficient CUDA kernels for training ConvNets with PyTorch.☆40Updated 2 months ago
- This is an open-source repository for constructing and researching fusion-style deep learning methods combined with pretrained vision mod…☆15Updated 4 months ago
- [ICLR 2025] CAMEx: Curvature-Aware Merging of Experts☆19Updated 2 months ago
- ☆16Updated last year
- Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation☆35Updated last year
- Explorations into the recently proposed Taylor Series Linear Attention☆99Updated 8 months ago
- Implementations of attention with the softpick function, naive and FlashAttention-2☆61Updated 2 weeks ago
- Implementation of the proposed MaskBit from Bytedance AI☆76Updated 6 months ago
- Flash-Muon: An Efficient Implementation of Muon Optimizer☆103Updated last week
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆25Updated last year
- Tiny re-implementation of MDM in style of LLaDA and nano-gpt speedrun☆50Updated 2 months ago
- python scripts for crawling original image from Google Images☆22Updated 3 years ago
- Writing FLUX in Triton☆33Updated 7 months ago
- LibMoE: A LIBRARY FOR COMPREHENSIVE BENCHMARKING MIXTURE OF EXPERTS IN LARGE LANGUAGE MODELS☆37Updated 2 weeks ago
- Solution for Zalo AI Challenge 2022 - E2E Question Answering☆111Updated 2 years ago
- Top 1 Quy Nhon AI Hackathon 2022 Challenge Smart Menu☆30Updated 2 years ago
- Pre-training script for BART in JAX/Flax☆38Updated 2 years ago
- Just some miscellaneous utility functions / decorators / modules related to Pytorch and Accelerate to help speed up implementation of new…☆121Updated 9 months ago
- Load compute kernels from the Hub☆119Updated last week
- Implementation of Infini-Transformer in Pytorch☆110Updated 4 months ago
- 👨🏻💻 Code release for Vietnamese chatbot from scratch [Published in IEEE IMCOM 2022]☆17Updated 9 months ago
- Little article showing how to load pytorch's models with linear memory consumption☆34Updated 2 years ago