HuyNguyen-hust / flash-attn-101Links
☆21Updated 9 months ago
Alternatives and similar repositories for flash-attn-101
Users that are interested in flash-attn-101 are comparing it to the libraries listed below
Sorting:
- Pioneering in Vietnamese Multimodal Large Language Model☆47Updated 4 months ago
- ☆70Updated last year
- [ICLR 2025] CAMEx: Curvature-Aware Merging of Experts☆20Updated 3 months ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆26Updated last year
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆26Updated last year
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated 11 months ago
- Bud500: A Comprehensive Vietnamese ASR Dataset☆66Updated last year
- PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)☆43Updated this week
- ☆14Updated 2 years ago
- This is an open-source repository for constructing and researching fusion-style deep learning methods combined with pretrained vision mod…☆14Updated 5 months ago
- Implementation of Infini-Transformer in Pytorch☆111Updated 5 months ago
- Baseline achieving 0.8 accuracy on the private test set in the ZaloAI Challenge 2023 Elementary Math Solving☆24Updated last year
- Machine Reading Comprehension special for the Vietnamese language☆40Updated 3 years ago
- Pre-training script for BART in JAX/Flax☆38Updated 2 years ago
- ☆16Updated last year
- Top 1 Quy Nhon AI Hackathon 2022 Challenge Smart Menu☆30Updated 2 years ago
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆70Updated last year
- Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation☆35Updated last year
- Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding☆12Updated last year
- Solution for Zalo AI Challenge 2022 - E2E Question Answering☆111Updated 2 years ago
- ViDeBERTa: A powerful pre-trained language model for Vietnamese, EACL 2023☆56Updated last year
- Mobile Viewer for W&B, built on top of Flutter.☆34Updated last year
- Use LoRA technique to improve training Large Language Model☆12Updated last year
- Implementations of attention with the softpick function, naive and FlashAttention-2☆76Updated last month
- ☆18Updated 2 years ago
- ☆46Updated 2 years ago
- ☆70Updated 2 years ago
- Source code for Zalo AI 2021 submission☆140Updated 3 years ago
- Top 2 Solution for Zalo AI Challenge 2022 - Liveness Detection track☆43Updated 2 years ago
- Load compute kernels from the Hub☆144Updated this week