junfanz1/MiniGPT-and-DeepSeek-MLA-Multi-Head-Latent-Attention

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/junfanz1/MiniGPT-and-DeepSeek-MLA-Multi-Head-Latent-Attention)

junfanz1 / MiniGPT-and-DeepSeek-MLA-Multi-Head-Latent-Attention

An efficient and scalable attention module designed to reduce memory usage and improve inference speed in large language models. Designed and implemented the Multi-Head Latent Attention (MLA) module as a drop-in replacement for traditional multi-head attention (MHA) in large language models.

☆21

Alternatives and similar repositories for MiniGPT-and-DeepSeek-MLA-Multi-Head-Latent-Attention

Users that are interested in MiniGPT-and-DeepSeek-MLA-Multi-Head-Latent-Attention are comparing it to the libraries listed below

Sorting:

Maxr1998 / LightStickMod
View on GitHub
Modding the LOOΠΔ light stick with a custom PCB/firmware, rechargeable battery and a companion Android app for wireless control.
☆13Sep 16, 2022Updated 3 years ago
AbdullahHendy / live-translation
View on GitHub
Real-time speech-to-text translation over WebSocket. Streams Opus or raw PCM audio from client to server for live transcription and optio…
☆13Feb 20, 2026Updated 2 weeks ago
burke86 / deepdisc
View on GitHub
☆11Jun 7, 2024Updated last year
abhi7585 / handwritten-digit-classification-streamlit
View on GitHub
Handwritten digit classification web app using Streamlit
☆10Jan 15, 2024Updated 2 years ago
pydanny / cookiecutter-pymodule
View on GitHub
Like cookiecutter_pypackage, but for just a module.
☆14Jul 27, 2016Updated 9 years ago
cguardia / cookiecutter-pyramid
View on GitHub
A Cookie Cutter template for a Pyramid package
☆10Jun 2, 2016Updated 9 years ago
Atulit23 / ai_from_scratch
View on GitHub
☆16Jul 7, 2025Updated 8 months ago
caskcsg / longcontext
View on GitHub
Long Context Research
☆29Jan 26, 2026Updated last month
tillahoffmann / time_series
View on GitHub
Model-based time series clustering using variational inference.
☆12Oct 28, 2018Updated 7 years ago
antonio-f / BERT_from_scratch
View on GitHub
Training a BERT model from scratch.
☆11Oct 15, 2023Updated 2 years ago
google / tcpgpudmarxd
View on GitHub
☆11Feb 17, 2026Updated 3 weeks ago
klasnordmark / openlane-examples
View on GitHub
Examples from the Openlane repository, adapted as Fusesoc cores
☆12May 18, 2021Updated 4 years ago
francois2metz / Python-piwik
View on GitHub
Access to Piwik API in Python + django app.
☆19Apr 15, 2012Updated 13 years ago
hrlics / HoPE
View on GitHub
[NeurIPS 2025] HoPE: Hybrid of Position Embedding for Long Context Vision-Language Models
☆27Feb 19, 2026Updated 2 weeks ago
dmgsantos / WaterPy
View on GitHub
WaterPy - Water and Environment Tools in Python
☆15Mar 12, 2017Updated 8 years ago
NoRedInk / make-lambda-package
View on GitHub
Bundle up Python deployment packages for AWS Lambda
☆14Apr 20, 2021Updated 4 years ago
prigarg / Bigram-Language-Model-from-Scratch
View on GitHub
A Bigram Language Model from scratch with no-smoothing and add-one smoothing. Outputs bigram counts, bigram probabilities and probability…
☆15Jan 12, 2021Updated 5 years ago
jacklehamster / bun-engine
View on GitHub
Graphics engine for games, set on top of bun.js.
☆20Apr 2, 2025Updated 11 months ago
josephg / diamond-js
View on GitHub
Javascript wrapper bindings for diamond types
☆13Sep 13, 2021Updated 4 years ago
arteria / django-admin-keyboard-shortcuts
View on GitHub
Keyboard Shortcuts for your Django Admin Backend.
☆13Sep 14, 2015Updated 10 years ago
JasonBrave / pci-edu
View on GitHub
SystemVerilog implemention of QEMU PCI edu device
☆13May 22, 2023Updated 2 years ago
cdmckay / biomorphjs
View on GitHub
A JavaScript implementation of Richard Dawkin's Biomorph, a simulation that demonstrates the power of natural selection.
☆13Dec 11, 2012Updated 13 years ago
jaredstarbell / PenroseSubstitution
View on GitHub
Penrose tile composition using only two shapes and a few substitution rules
☆13Feb 12, 2020Updated 6 years ago
btompkins / CodeBetter.Com-Fabric
View on GitHub
Fabric Scripts for setup of CodeBetter.Com's Linux Host
☆24Feb 23, 2011Updated 15 years ago
judy2k / kylie
View on GitHub
Kylie maps between Model objects and JSON data structures.
☆12Dec 26, 2022Updated 3 years ago
vukrosic / muon-optimizer-guide
View on GitHub
Use Muon optimizer instead of AdamW.
☆39Mar 2, 2026Updated last week
facebookresearch / Exact-Byte-Level-Probabilities-from-Tokenized-LMs
View on GitHub
Example implementation of "Exact Byte-Level Probabilities from Tokenized Language Models for FIM-Tasks and Model Ensembles" by Buu Phan, …
☆18Jan 22, 2026Updated last month
lalaland1921 / Root_Cause_Detection_System_Demo
View on GitHub
一个部署在windows本地主机上的根因分析系统，作为数据库，网页开发练手的小demo
☆14Jun 24, 2020Updated 5 years ago
mknecht / pyfs
View on GitHub
Mount python — it's fun, not a typo, and next to pointless!
☆50Jun 30, 2014Updated 11 years ago
mjtamlyn / tomek
View on GitHub
When you really need a Tomek decorator
☆12May 4, 2022Updated 3 years ago
oreilly-japan / hands-on-llm-ja
View on GitHub
☆24Oct 21, 2025Updated 4 months ago
pnnl / rofi
View on GitHub
☆16Feb 27, 2026Updated last week
asweigart / showcallstack
View on GitHub
Shows a simplified view of the call stack.
☆11Aug 25, 2022Updated 3 years ago
MonikaVen / LLM-Prompting-RAG-Aligment-Training-Tutorial
View on GitHub
A 4-hour long tutorial session for learning to use LLMs and align them with custom data. We will also train a custom LLM.
☆17Sep 12, 2024Updated last year
kangzhiq / MISA
View on GitHub
[ICLR 2025] Official PyTorch implementation of our paper for general continual learning "Advancing Prompt-Based Methods for Replay-Indepe…
☆16Dec 21, 2025Updated 2 months ago
KerolosAtef / Stock-market-prediction-using-sentiment-analysis-of-twitter
View on GitHub
☆14Aug 31, 2022Updated 3 years ago
parthpower / aes-fpga
View on GitHub
AES implementation on FPGA
☆13Apr 17, 2016Updated 9 years ago
brianboonstra / ragtop
View on GitHub
Financial derivatives pricing and calibration using linked equity and credit models
☆19Aug 4, 2025Updated 7 months ago
Quiota / tensorflow
View on GitHub
☆10Jun 8, 2017Updated 8 years ago