Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation
☆25Feb 18, 2025Updated last year
Alternatives and similar repositories for CoLA
Users that are interested in CoLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆19Feb 2, 2026Updated last month
- Fast and memory-efficient exact attention☆19Mar 9, 2026Updated 3 weeks ago
- [NeurIPS 2021] Code for Unsupervised Learning of Compositional Energy Concepts☆62Sep 21, 2022Updated 3 years ago
- An implementation of Olshausen and Field (96) in PyTorch☆32Aug 16, 2020Updated 5 years ago
- Apply CP, Tucker, TT/TR, HT to compress neural networks. Train from scratch.☆17Nov 26, 2020Updated 5 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆25Oct 31, 2024Updated last year
- ☆29Feb 28, 2025Updated last year
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- Code release of "DeepOHeat: Operator Learning-based Ultra-fast Thermal Simulation in 3D-IC Design", DAC 2023. https://arxiv.org/pdf/2302.…☆32Sep 3, 2024Updated last year
- Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.☆53Dec 15, 2025Updated 3 months ago
- ☆10Apr 16, 2024Updated last year
- So, I trained a Llama a 130M architecture I coded from ground up to build a small instruct model from scratch. Trained on FineWeb dataset…☆17Mar 26, 2025Updated last year
- Implementation of LaViC (KDD 2025)☆12Jun 1, 2025Updated 9 months ago
- Code implementation for paper "On the Efficacy of Small Self-Supervised Contrastive Models without Distillation Signals".☆17Dec 15, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- 北京大学 2024 秋季学期编译原理课程 Lab 代码、笔记、经验☆17Sep 12, 2025Updated 6 months ago
- The official source code for [2026 ICLR] "IR-Agent: Expert-Inspired LLM Agents for Structure Elucidation from Infrared Spectra"☆11Feb 25, 2026Updated last month
- ☆19Dec 23, 2024Updated last year
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆17Dec 17, 2025Updated 3 months ago
- An innovative method expediting LLMs via streamlined semi-autoregressive generation and draft verification.☆28Apr 15, 2025Updated 11 months ago
- ☆38Aug 7, 2025Updated 7 months ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆14Apr 29, 2025Updated 11 months ago
- [NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models☆39Jan 9, 2025Updated last year
- ☆14Jun 18, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Implementation of NAACL'25 "Empowering Retrieval-based Conversational Recommendation with Contrasting User Preferences"☆14Sep 9, 2025Updated 6 months ago
- (WWW'25 + Netflix) The first CRS that retrieves collaborative filtering knowledge with two-step context-aware reflection.☆21Sep 10, 2025Updated 6 months ago
- Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"☆21Feb 29, 2024Updated 2 years ago
- ☆43Oct 15, 2025Updated 5 months ago
- ☆20Oct 13, 2024Updated last year
- The official source code for "Subgraph Federated Learning for Local Generalization (FedLoG)" at ICLR 2025 (Oral).☆15May 6, 2025Updated 10 months ago
- Codebase for adaptive continual memory☆14Aug 15, 2023Updated 2 years ago
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆21Oct 15, 2024Updated last year
- A Pytorch implementation of Collaborative Metric Learning (CML)☆11Oct 13, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆12Nov 1, 2024Updated last year
- Variance Covariance Regularization☆14Jun 22, 2023Updated 2 years ago
- [ ICLR 2025 ] Making LLMs More Effective with Hierarchical Mixture of LoRA Experts☆29Oct 9, 2025Updated 5 months ago
- Domain-specific language designed to streamline the development of high-performance GPU/CPU/Accelerators kernels☆49Feb 28, 2026Updated last month
- Source code for PECRS (EACL 2024)☆12Feb 3, 2024Updated 2 years ago
- [CVPR 2025] QuartDepth☆17Mar 24, 2025Updated last year
- The official source code for "Vision Language Model is NOT All You Need: Augmentation Strategies for Molecule Language Model".☆14Jul 23, 2024Updated last year