epfml/pam

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/epfml/pam)

epfml / pam

☆16

Alternatives and similar repositories for pam

Users that are interested in pam are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

egochao / transformer_with_einsum
View on GitHub
Transformer from scratch with einsum method
☆11Jul 8, 2021Updated 5 years ago
adobe-research / beacon-aug
View on GitHub
Cross-library augmentation toolbox supporting 300 operators over 8 libraries + AI transforms
☆12Jan 11, 2022Updated 4 years ago
Liuhong99 / implicitbiasmlmcode
View on GitHub
☆13Mar 22, 2023Updated 3 years ago
GATECH-EIC / Linearized-LLM
View on GitHub
[ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models
☆35Jun 12, 2024Updated 2 years ago
nimish15shah / DAG_Processor
View on GitHub
A DAG processor and compiler for a tree-based spatial datapath.
☆16Aug 24, 2022Updated 3 years ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
Cheliosoops / BitQ
View on GitHub
☆10Apr 24, 2024Updated 2 years ago
jenni-ai / T2FW
View on GitHub
Fine-Tuning Pre-trained Transformers into Decaying Fast Weights
☆20Oct 9, 2022Updated 3 years ago
trustworthy-machine-learning / trustworthy-machine-learning.github.io
View on GitHub
A School for All Seasons on Trustworthy Machine Learning
☆12Jun 30, 2021Updated 5 years ago
epfml / REQ
View on GitHub
☆19Jun 10, 2024Updated 2 years ago
zaydzuhri / flame
View on GitHub
Fork of Flame repo for training of some new stuff in development
☆20Updated this week
OpenGVLab / LLMPrune-BESA
View on GitHub
BESA is a differentiable weight pruning technique for large language models.
☆17Mar 4, 2024Updated 2 years ago
Zcchill / Value-Residual-Learning
View on GitHub
☆15Mar 20, 2025Updated last year
OpenNLPLab / ETSC-Exact-Toeplitz-to-SSM-Conversion
View on GitHub
[EMNLP 2023] Official implementation of the algorithm ETSC: Exact Toeplitz-to-SSM Conversion our EMNLP 2023 paper - Accelerating Toeplitz…
☆14Oct 17, 2023Updated 2 years ago
GATECH-EIC / ShiftAddViT
View on GitHub
[NeurIPS 2023] ShiftAddViT: Mixture of Multiplication Primitives Towards Efficient Vision Transformer
☆30Dec 6, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
jungokasai / T2R
View on GitHub
☆14Nov 20, 2022Updated 3 years ago
MurtyShikhar / Pushdown-Layers
View on GitHub
Code for Pushdown Layers from our EMNLP 2023 paper
☆28Dec 3, 2023Updated 2 years ago
thu-nics / PM-KVQ
View on GitHub
The official code implementation for paper "PM-KVQ: Progressive Mixed-precision KV Cache Quantization for Long-CoT LLMs"
☆29May 24, 2025Updated last year
gmongaras / Cottention_Transformer
View on GitHub
Code for the paper "Cottention: Linear Transformers With Cosine Attention"
☆20Nov 15, 2025Updated 8 months ago
lwy2020 / MicroMix
View on GitHub
MicroMix: Efficient Mixed-Precision Quantization with Microscaling Formats for Large Language Models
☆27Apr 2, 2026Updated 3 months ago
oujieww / ANPD
View on GitHub
☆11Feb 5, 2026Updated 5 months ago
LINs-lab / cluster_tutorial
View on GitHub
☆17Mar 19, 2026Updated 4 months ago
socialfoundations / tttlm
View on GitHub
Test-time-training on nearest neighbors for large language models
☆50Apr 18, 2024Updated 2 years ago
Doraemonzzz / xmixers
View on GitHub
Xmixers: A collection of SOTA efficient token/channel mixers
☆28Sep 4, 2025Updated 10 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
annosubmission / GRC-Cache
View on GitHub
☆16Mar 13, 2023Updated 3 years ago
manantomar / video-occupancy-models
View on GitHub
☆13Jul 16, 2024Updated 2 years ago
maximzubkov / fft-scan
View on GitHub
Efficient PScan implementation in PyTorch
☆17Jan 2, 2024Updated 2 years ago
emalach / LinearLM
View on GitHub
Code for the paper: https://arxiv.org/pdf/2309.06979.pdf
☆21Jul 29, 2024Updated last year
BidyutSaha / TinyTNAS
View on GitHub
TinyTNAS is a hardware-aware, multi-objective, time-bound Neural Architecture Search (NAS) tool designed for TinyML time series classific…
☆22Dec 11, 2024Updated last year
Intelligent-Computing-Lab-Panda / TesseraQ
View on GitHub
☆25Oct 31, 2024Updated last year
luciferkonn / DT_Mem
View on GitHub
Thisi is the official code base for paper "Think Before You Act: Decision Transformers with Internal Working Memory"
☆23Jul 12, 2024Updated 2 years ago
kangmintong / R-2-Guard
View on GitHub
[ICLR 2025] Code implementation of R^2-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning
☆23Jul 8, 2024Updated 2 years ago
facebookresearch / loop_nest
View on GitHub
Loop Nest - Linear algebra compiler and code generator.
☆20Oct 22, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
fla-org / fla-zoo
View on GitHub
Flash-Linear-Attention models beyond language
☆21Aug 28, 2025Updated 10 months ago
deep-spin / adasplash
View on GitHub
AdaSplash: Adaptive Sparse Flash Attention (aka Flash Entmax Attention)
☆46May 20, 2026Updated 2 months ago
alanbacellar / DWN
View on GitHub
Differentiable Weightless Neural Networks
☆44Feb 2, 2026Updated 5 months ago
cunfate / coroutinecc
View on GitHub
基于ucontext实现的C++协程库
☆18Jul 27, 2018Updated 7 years ago
yoniLc / E2E_DC_ECCT
View on GitHub
Learning Linear Block Error Correction Codes
☆20May 8, 2024Updated 2 years ago
Ruichen0424 / AB-BNN
View on GitHub
[CVPR 2024] Official implementation for "A&B BNN: Add&Bit-Operation-Only Hardware-Friendly Binary Neural Network"
☆24Jul 9, 2026Updated last week
dangxingyu / rnn-icrag
View on GitHub
Official repository of paper "RNNs Are Not Transformers (Yet): The Key Bottleneck on In-context Retrieval"
☆27Apr 17, 2024Updated 2 years ago