HuyNguyen-hust / flash-attn-101
☆22Updated 7 months ago
Alternatives and similar repositories for flash-attn-101:
Users that are interested in flash-attn-101 are comparing it to the libraries listed below
- Pioneering in Vietnamese Multimodal Large Language Model☆46Updated 2 months ago
- ☆66Updated 11 months ago
- This is the official repository for Vista dataset - A Vietnamese multimodal dataset contains more than 700,000 samples of conversations a…☆25Updated 11 months ago
- PhoMT: A High-Quality and Large-Scale Benchmark Dataset for Vietnamese-English Machine Translation (EMNLP 2021)☆42Updated 8 months ago
- Vistral-V: Visual Instruction Tuning for Vistral - Vietnamese Large Vision-Language Model.☆22Updated 9 months ago
- Bud500: A Comprehensive Vietnamese ASR Dataset☆65Updated last year
- [CVPR 2025] h-Edit: Effective and Flexible Diffusion-Based Editing via Doob’s h-Transform☆41Updated 3 weeks ago
- Pre-training script for BART in JAX/Flax☆38Updated 2 years ago
- Distillation Contrastive Decoding: Improving LLMs Reasoning with Contrastive Decoding and Distillation☆35Updated last year
- ☆13Updated 2 years ago
- Machine Reading Comprehension special for the Vietnamese language☆40Updated 3 years ago
- Implementation of paper: ConvNet for the 2020s☆21Updated 2 years ago
- This is an open-source repository for constructing and researching fusion-style deep learning methods combined with pretrained vision mod…☆14Updated 3 months ago
- A block oriented training approach for inference time optimization.☆32Updated 7 months ago
- [ICLR 2025] CAMEx: Curvature-Aware Merging of Experts☆18Updated last month
- Text Query based Traffic Video Event Retrieval with Global-Local Fusion Embedding☆12Updated last year
- VNHSGE: Vietnamese High School Graduation Examination Dataset for Large Language Models☆25Updated last year
- Solution for Zalo AI Challenge 2022 - E2E Question Answering☆111Updated 2 years ago
- General template for most Pytorch projects☆34Updated last week
- Baseline for ZaloAI Challenge 2023 Elementary Math Solving☆69Updated last year
- ☆66Updated 2 years ago
- BARTpho: Pre-trained Sequence-to-Sequence Models for Vietnamese (INTERSPEECH 2022)☆103Updated 8 months ago
- Top 1 Quy Nhon AI Hackathon 2022 Challenge Smart Menu☆30Updated 2 years ago
- ☆46Updated last year
- LibMoE: A LIBRARY FOR COMPREHENSIVE BENCHMARKING MIXTURE OF EXPERTS IN LARGE LANGUAGE MODELS☆37Updated 2 months ago
- [WIP] Better (FP8) attention for Hopper☆27Updated last month
- ☆77Updated 10 months ago
- Synthetic Alphabet Dataset☆18Updated 3 weeks ago
- Implementation of Infini-Transformer in Pytorch☆110Updated 3 months ago
- Use LoRA technique to improve training Large Language Model☆12Updated last year