Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation
☆26Feb 18, 2025Updated last year
Alternatives and similar repositories for CoLA
Users that are interested in CoLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repo for separable operator networks -- extreme-scale operator learning for parametric PDEs.☆39Nov 2, 2024Updated last year
- [NeurIPS 2021] Code for Unsupervised Learning of Compositional Energy Concepts☆60Sep 21, 2022Updated 3 years ago
- ☆25Oct 31, 2024Updated last year
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 8 months ago
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.☆55Dec 15, 2025Updated 5 months ago
- ☆19Nov 6, 2023Updated 2 years ago
- ☆10Apr 16, 2024Updated 2 years ago
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆17Dec 15, 2021Updated 4 years ago
- ☆13Dec 13, 2024Updated last year
- The official source code for [2026 ICLR] "IR-Agent: Expert-Inspired LLM Agents for Structure Elucidation from Infrared Spectra"☆13Feb 25, 2026Updated 3 months ago
- PyTorch implementation of the SIESTA algorithm from our TMLR-2023 paper "SIESTA: Efficient Online Continual Learning with Sleep"☆13Oct 25, 2024Updated last year
- An innovative method expediting LLMs via streamlined semi-autoregressive generation and draft verification.☆28Apr 15, 2025Updated last year
- This is anonymous repository for submitting our work to a conference☆14Dec 17, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆37Aug 7, 2025Updated 9 months ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆14Apr 29, 2025Updated last year
- [NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models☆39Jan 9, 2025Updated last year
- ☆22Dec 23, 2024Updated last year
- (WWW'25 + Netflix) The first CRS that retrieves collaborative filtering knowledge with two-step context-aware reflection.☆21Sep 10, 2025Updated 8 months ago
- Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"☆21Feb 29, 2024Updated 2 years ago
- ☆45Oct 15, 2025Updated 7 months ago
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆22May 28, 2024Updated 2 years ago
- ☆11Apr 5, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- A Pytorch implementation of Collaborative Metric Learning (CML)☆11Oct 13, 2020Updated 5 years ago
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆22Oct 15, 2024Updated last year
- Multi-dimensional analysis of orthogonal safety directions in LLM alignment☆22Mar 20, 2025Updated last year
- ☆12Nov 1, 2024Updated last year
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆28Jun 16, 2025Updated 11 months ago
- ☆25Nov 10, 2021Updated 4 years ago
- Variance Covariance Regularization☆14Jun 22, 2023Updated 2 years ago
- Source code for PECRS (EACL 2024)☆12Feb 3, 2024Updated 2 years ago
- SMART introduces a novel test-time framework where Small Language Models (SLMs) reason step-by-step, and Large Language Models (LLMs) pro…☆12Jul 9, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The Stream-51 dataset for streaming classification and novelty detection from videos.☆17Feb 22, 2022Updated 4 years ago
- Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels☆53Mar 31, 2026Updated 2 months ago
- [CVPR 2025] QuartDepth☆18Mar 24, 2025Updated last year
- ☆19Jan 3, 2025Updated last year
- ☆15Apr 6, 2026Updated last month
- ☆28Jun 20, 2025Updated 11 months ago
- A Continual Learning Library in PyTorch and JAX☆13Apr 18, 2023Updated 3 years ago