from MHA, MQA, GQA to MLA by 苏剑林, with code
☆45Feb 19, 2025Updated last year
Alternatives and similar repositories for MLA_tutorial
Users that are interested in MLA_tutorial are comparing it to the libraries listed below
Sorting:
- ☆14Apr 19, 2024Updated last year
- StaRD: Statute Retrieval Dataset based on Real-World Legal Consultation☆20Apr 24, 2025Updated 10 months ago
- Math24o: 高中奥林匹克数学竞赛测评集 High School Olympiad Mathematics Chinese Benchmark☆11Mar 27, 2025Updated 11 months ago
- Implement Flash Attention using Cute.☆102Dec 17, 2024Updated last year
- Official repository for the paper Local Linear Attention: An Optimal Interpolation of Linear and Softmax Attention For Test-Time Regressi…☆23Oct 1, 2025Updated 5 months ago
- The official code for NAACL 2024 paper: $E^5$: Zero-shot Hierarchical Table Analysis using Augmented LLMs via Explain, Extract, Execute, …☆14Jun 23, 2024Updated last year
- ☆30Aug 25, 2023Updated 2 years ago
- Rank-DistiLLM: Closing the Effectiveness Gap Between Cross-Encoders and LLMs for Passage Re-Ranking☆25Apr 4, 2025Updated 11 months ago
- Implement custom operators in PyTorch with cuda/c++☆76Jan 1, 2023Updated 3 years ago
- ☆54Jan 5, 2026Updated 2 months ago
- Evaluate how vLLM and SGLang perform when running a small LLM model on a mid-range NVIDIA GPU☆20Mar 15, 2026Updated last week
- Modeling methods of System Dynamics – Supply Chain Simulation using the Anylogic software☆10Jan 8, 2026Updated 2 months ago
- [ECCV24] The official code repository for paper "Training-Free Model Merging for Multi-target Domain Adaptation".☆17Sep 27, 2024Updated last year
- ASR project with pytorch-lightning☆20Mar 21, 2025Updated last year
- Transfer Learning of Graph Neural Networks with Ego-graph Information Maximization (NeurIPS 21')☆23Dec 9, 2021Updated 4 years ago
- Library for epidemics on hypergraphs☆12May 13, 2024Updated last year
- Cross Domain Structure Preserving Projection for Heterogeneous Domain Adaptation☆16Dec 15, 2022Updated 3 years ago
- Topic taxonomy completion with hierarchical discovery of novel topic clusters☆24Mar 7, 2022Updated 4 years ago
- CenterPoint model trained with MMDetection3d on custom dataset, and deployed with TensorRT☆34Mar 15, 2023Updated 3 years ago
- Cross Visual Prompt Tuning [ICCV 2025]☆13Aug 3, 2025Updated 7 months ago
- Contextual Position Encoding but with some custom CUDA Kernels https://arxiv.org/abs/2405.18719☆22Jun 5, 2024Updated last year
- The official github repo for the open online courses: "Dive into LLMs".☆10Mar 15, 2024Updated 2 years ago
- python igraph tutorial☆11Nov 23, 2023Updated 2 years ago
- ☆97Feb 11, 2026Updated last month
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- A tool to visualize convolutional layer activations on an input image.☆17Oct 23, 2019Updated 6 years ago
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆14Dec 16, 2024Updated last year
- [CVPR 2026] Layer-wise Scale Alignment for Training-Free Streaming 4D Reconstruction☆54Updated this week
- ☆42Apr 9, 2024Updated last year
- ☆13May 12, 2025Updated 10 months ago
- ☆17Feb 24, 2025Updated last year
- Empower your real estate decisions with our data-driven model, delivering precise rental predictions for landlords and comprehensive insi…☆12Apr 26, 2025Updated 10 months ago
- 百度人体分析Demo:人体关键点、人体属性、手势识别、人像分割、人流量统计、驾驶行为分析(邀测)、人流量统计动态版(邀测)☆15Nov 29, 2018Updated 7 years ago
- a fast and customizable CUDA int4 tensor core gemm☆15Aug 2, 2024Updated last year
- ☆12Sep 23, 2024Updated last year
- The official implement of paper S2-VER: Semi-Supervised Visual Emotion Recognition☆11Apr 28, 2024Updated last year
- ☆33Dec 10, 2025Updated 3 months ago
- **ASCM4ABSA** - Our code and proposed data for NLPCC 2022 paper titled "Aspect-specific Context Modeling for Aspect-based Sentiment Analy…☆12Mar 26, 2023Updated 2 years ago
- Course project for CS230. Implemented using PyTorch.☆16Dec 17, 2018Updated 7 years ago