Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation
☆26Feb 18, 2025Updated last year
Alternatives and similar repositories for CoLA
Users that are interested in CoLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆20Feb 2, 2026Updated 4 months ago
- Fast and memory-efficient exact attention☆21Updated this week
- Apply CP, Tucker, TT/TR, HT to compress neural networks. Train from scratch.☆17Nov 26, 2020Updated 5 years ago
- ☆25Oct 31, 2024Updated last year
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.☆56Dec 15, 2025Updated 6 months ago
- Sancho McCann's PhD Thesis Research Code☆25Oct 12, 2017Updated 8 years ago
- ☆10Apr 16, 2024Updated 2 years ago
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆18Mar 26, 2025Updated last year
- Implementation of LaViC (KDD 2025)☆13Jun 1, 2025Updated last year
- ☆13Dec 13, 2024Updated last year
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆19Dec 17, 2025Updated 6 months ago
- An innovative method expediting LLMs via streamlined semi-autoregressive generation and draft verification.☆28Apr 15, 2025Updated last year
- This is anonymous repository for submitting our work to a conference☆14Dec 17, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of NAACL'25 "Empowering Retrieval-based Conversational Recommendation with Contrasting User Preferences"☆14Sep 9, 2025Updated 9 months ago
- ☆22Dec 23, 2024Updated last year
- (WWW'25 + Netflix) The first CRS that retrieves collaborative filtering knowledge with two-step context-aware reflection.☆21Sep 10, 2025Updated 9 months ago
- Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"☆21Feb 29, 2024Updated 2 years ago
- ☆20Oct 13, 2024Updated last year
- [ICML 2024] SPP: Sparsity-Preserved Parameter-Efficient Fine-Tuning for Large Language Models☆22May 28, 2024Updated 2 years ago
- ☆11Apr 5, 2023Updated 3 years ago
- Codebase for adaptive continual memory☆15Aug 15, 2023Updated 2 years ago
- A Pytorch implementation of Collaborative Metric Learning (CML)☆11Oct 13, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆22Oct 15, 2024Updated last year
- Multi-dimensional analysis of orthogonal safety directions in LLM alignment☆22Jun 12, 2026Updated 2 weeks ago
- ☆12Nov 1, 2024Updated last year
- ☆26Nov 10, 2021Updated 4 years ago
- Variance Covariance Regularization☆14Jun 22, 2023Updated 3 years ago
- [ ICLR 2025 ] Making LLMs More Effective with Hierarchical Mixture of LoRA Experts☆32Oct 9, 2025Updated 8 months ago
- SMART introduces a novel test-time framework where Small Language Models (SLMs) reason step-by-step, and Large Language Models (LLMs) pro…☆12Jul 9, 2025Updated 11 months ago
- Source code for PECRS (EACL 2024)☆12Feb 3, 2024Updated 2 years ago
- [CVPR 2025] QuartDepth☆18Mar 24, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- The official source code for "Vision Language Model is NOT All You Need: Augmentation Strategies for Molecule Language Model".☆14Jul 23, 2024Updated last year
- ☆19Jan 3, 2025Updated last year
- ☆15Apr 6, 2026Updated 2 months ago
- A Continual Learning Library in PyTorch and JAX☆13Apr 18, 2023Updated 3 years ago
- ☆30Jun 20, 2025Updated last year
- The official source code for "Subgraph Federated Learning for Local Generalization (FedLoG)" at ICLR 2025 (Oral).☆17May 6, 2025Updated last year
- (ICML 2023) Feature learning in deep classifiers through Intermediate Neural Collapse: Accompanying code☆16Jul 27, 2023Updated 2 years ago