hpcgroup/loki

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hpcgroup/loki)

hpcgroup / loki

Algorithms for approximate attention in LLMs

☆22

Alternatives and similar repositories for loki

Users that are interested in loki are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hpcgroup / damselfly
View on GitHub
Damselfly Network Simulator
☆10Nov 19, 2020Updated 5 years ago
JonasGeiping / dataaugs
View on GitHub
☆18Oct 12, 2022Updated 3 years ago
JonasGeiping / fullbatchtraining
View on GitHub
Training vision models with full-batch gradient descent and regularization
☆40Feb 14, 2023Updated 3 years ago
prajwal1210 / Stance-Detection-in-Web-and-Social-Media
View on GitHub
☆32Oct 18, 2020Updated 5 years ago
mcleish7 / gemstone-scaling-laws
View on GitHub
Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)
☆35Sep 28, 2025Updated 9 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
aks2203 / easy-to-hard-data
View on GitHub
Pytorch Datasets for Easy-To-Hard
☆30Jan 9, 2025Updated last year
axonn-ai / axonn
View on GitHub
Parallel framework for training and fine-tuning deep neural networks
☆74Apr 28, 2026Updated 2 months ago
morse-benchmark / morse-500
View on GitHub
☆31May 21, 2026Updated last month
montehoover / DynaGuard
View on GitHub
Code for "DynaGuard: A Dynamic Guardrail Model With User-Defined Policies."
☆23Nov 3, 2025Updated 8 months ago
Guinan-Su / auto-merge-llm
View on GitHub
An official repository for GPTailor
☆18Jun 29, 2025Updated last year
hsouri / GDP
View on GitHub
Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion
☆11Apr 1, 2024Updated 2 years ago
facebookresearch / scalable-curvature
View on GitHub
Code for Dayal Kalra's research internship on scalable curvature measures for neural networks.
☆29Feb 3, 2026Updated 5 months ago
parallelcodefoundry / HPC-Coder
View on GitHub
Scripts for fine-tuning an HPC Code LLM
☆17Jul 19, 2024Updated 2 years ago
YuxinWenRick / canary-in-a-coalmine
View on GitHub
☆33Nov 27, 2023Updated 2 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
at-aaims / forge
View on GitHub
☆16Apr 21, 2025Updated last year
bigai-nlco / CREAM
View on GitHub
[NeurIPS 2024] | An Efficient Recipe for Long Context Extension via Middle-Focused Positional Encoding
☆22Oct 10, 2024Updated last year
llnl / CallFlow
View on GitHub
Visualization tool for analyzing call trees and graphs
☆36Mar 15, 2023Updated 3 years ago
formll / resolving-scaling-law-discrepancies
View on GitHub
☆19Nov 4, 2025Updated 8 months ago
nikhilchandak / answer-matching
View on GitHub
Code for 'Answer Matching Outperforms Multiple Choice for Language Model Evaluation' paper
☆18Jul 4, 2025Updated last year
AminJun / ImageNet1KBoundingBoxes
View on GitHub
Pytorch ImageNet1k Loader with Bounding Boxes.
☆13Jan 23, 2022Updated 4 years ago
ahans30 / goldfish-loss
View on GitHub
[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs
☆98Nov 17, 2024Updated last year
goldblum / FeatureClustering
View on GitHub
☆13Jun 7, 2020Updated 6 years ago
abdelfattah-lab / xKV
View on GitHub
xKV: Cross-Layer SVD for KV-Cache Compression [ICML 2026]
☆53Jul 7, 2026Updated 2 weeks ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
model-similarity / lm-similarity
View on GitHub
☆21Feb 10, 2025Updated last year
goldblum / free-lunch
View on GitHub
Implementation of experiments from The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning
☆17May 14, 2023Updated 3 years ago
arpitbansal297 / Certified_Watermarks
View on GitHub
☆16Jul 17, 2022Updated 4 years ago
webis-de / set-encoder
View on GitHub
Set-Encoder: Permutation-Invariant Inter-Passage Attention for Listwise Passage Re-Ranking with Cross-Encoders
☆19May 23, 2025Updated last year
ThisisBillhe / ZipCache
View on GitHub
[NeurIPS 2024] The official implementation of ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification
☆33Mar 30, 2025Updated last year
hamidkazemi22 / CLIPInversion
View on GitHub
What do we learn from inverting CLIP models?
☆58Mar 6, 2024Updated 2 years ago
Elvin-Yiming-Du / Memory-T1
View on GitHub
This respository is used for time reasoning task for mult-session dialogue system.
☆16Feb 7, 2026Updated 5 months ago
NVlabs / RocketKV
View on GitHub
[ICML 2025] RocketKV: Accelerating Long-Context LLM Inference via Two-Stage KV Cache Compression
☆49Aug 7, 2025Updated 11 months ago
yale-nlp / refdpo
View on GitHub
☆16Jul 23, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
rail-berkeley / tensorrt-openvla
View on GitHub
☆24Apr 30, 2025Updated last year
ml-research / deictic-segment-anything
View on GitHub
Segment Anything with Deictic Prompting
☆27May 13, 2025Updated last year
facebookresearch / rl-injector
View on GitHub
Official release of code for the paper RL is a hammer and LLMs are nails A simple RL approach to stronger prompt injection attacks
☆53May 6, 2026Updated 2 months ago
HendrikMuenster / flexBox
View on GitHub
FlexBox is a fexible MATLAB toolbox for finite dimensional convex variational problems in image processing and beyond.
☆15Apr 3, 2018Updated 8 years ago
HuyNguyen-hust / hopper-gemm-101
View on GitHub
☆13Dec 22, 2024Updated last year
mjalali / renyi-kernel-entropy
View on GitHub
[NeurIPS 2023] Code base for the Renyi Kernel Entropy (RKE) metric for generative models.
☆14Jun 18, 2025Updated last year
seal-rg / streaming
View on GitHub
Code for the paper Multi-Stream LLMs: Unblocking Language Models with Parallel Streams of Thoughts, Inputs and Outputs
☆63Jun 23, 2026Updated 3 weeks ago