☆20Oct 13, 2024Updated last year
Alternatives and similar repositories for EigenAttn
Users that are interested in EigenAttn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [TMLR] CoDeC: Communication-Efficient Decentralized Continual Learning☆12Apr 17, 2024Updated last year
- ☆34Mar 28, 2025Updated 11 months ago
- [Deep Unlearning-PyTorch] Class Forgetting as in paper "Deep Unlearning: Fast and Efficient Training-free Approach to Controlled Forgetti…☆15Jul 26, 2024Updated last year
- This repository contains bash scripts for launching, orchestrating, managing, and monitoring jobs on Purdue's RCAC clusters.☆22Dec 22, 2025Updated 3 months ago
- Official [AAAI] Code Repository for "Continual Learning with Scaled Gradient Projection".☆16Jun 28, 2023Updated 2 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆21May 28, 2024Updated last year
- Official Implementation of FastKV: Decoupling of Context Reduction and KV Cache Compression for Prefill-Decoding Acceleration☆30Nov 22, 2025Updated 4 months ago
- Source code of paper ''KVSharer: Efficient Inference via Layer-Wise Dissimilar KV Cache Sharing''☆31Oct 24, 2024Updated last year
- [ICLR 2025] Palu: Compressing KV-Cache with Low-Rank Projection☆155Feb 20, 2025Updated last year
- Codebase for Linguistic Collapse: Neural Collapse in (Large) Language Models [NeurIPS 2024] [arXiv:2405.17767]☆18Apr 14, 2025Updated 11 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 5 months ago
- ☆21Oct 2, 2024Updated last year
- ☆19Feb 2, 2026Updated last month
- Einsum with einops style variable names☆18May 16, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆10Apr 16, 2024Updated last year
- Source code of paper: A Stronger Mixture of Low-Rank Experts for Fine-Tuning Foundation Models. (ICML 2025)☆37Apr 2, 2025Updated 11 months ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆38Feb 27, 2024Updated 2 years ago
- Derivative-free nonlinear global optimizer with python interface☆17Nov 11, 2019Updated 6 years ago
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆17Dec 17, 2025Updated 3 months ago
- ☆14May 4, 2024Updated last year
- [NeurIPS 2025] Think Silently, Think Fast: Dynamic Latent Compression of LLM Reasoning Chains☆84Jul 29, 2025Updated 7 months ago
- ☆15Nov 7, 2024Updated last year
- [ICML 2023] Decentralized SGD and Average-direction SAM are Asymptotically Equivalent☆19Dec 4, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Открытые связанные данные по социальной статистике России (демография, социология, экономика). Тексты и инфографика на их основе — на сай…☆15Dec 1, 2019Updated 6 years ago
- The official implementation of "Well Begun is Half Done: Low-resource Preference Alignment by Weak-to-Strong Decoding"☆22Jun 26, 2025Updated 9 months ago
- ☆18Nov 1, 2023Updated 2 years ago
- ☆11Sep 7, 2024Updated last year
- The repository contains the implementation of the paper "SwinMSP: A Shifted Windows Masked Spectral Pretraining Model for Hyperspectral I…☆12Aug 7, 2024Updated last year
- Code for "RSQ: Learning from Important Tokens Leads to Better Quantized LLMs"☆21Mar 17, 2026Updated last week
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆21Oct 15, 2024Updated last year
- The \Latex Template for the Master Degree Graduate Thesis writting in China University of Geosciences.☆10Jun 5, 2015Updated 10 years ago
- [ICLR 2022] Official Code Repository for "TRGP: TRUST REGION GRADIENT PROJECTION FOR CONTINUAL LEARNING"☆22Oct 5, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ ICLR 2025 ] Making LLMs More Effective with Hierarchical Mixture of LoRA Experts☆28Oct 9, 2025Updated 5 months ago
- Container startup benchmark tool☆12Apr 10, 2023Updated 2 years ago
- [NeurIPS 2024] PyTorch code for the paper "Making Offline RL Online: Collaborative World Models for Offline Visual Reinforcement Learning…☆24Oct 24, 2025Updated 5 months ago
- Prune transformer layers☆74May 30, 2024Updated last year
- xKV: Cross-Layer SVD for KV-Cache Compression☆45Nov 30, 2025Updated 3 months ago
- TS-LLaVA: Constructing Visual Tokens through Thumbnail-and-Sampling for Training-Free Video Large Language Models☆19Jan 2, 2025Updated last year
- Supporting code for ReCEval paper☆31Sep 14, 2024Updated last year