Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation
☆26Feb 18, 2025Updated last year
Alternatives and similar repositories for CoLA
Users that are interested in CoLA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official repo for separable operator networks -- extreme-scale operator learning for parametric PDEs.☆39Nov 2, 2024Updated last year
- ☆20Feb 2, 2026Updated 3 months ago
- Official repo for ICCV 2025 paper "Is Less More? Exploring Token Condensation as Training-free Test-time Adaptation"☆18Sep 3, 2025Updated 8 months ago
- The official implementation of TinyTrain [ICML '24]☆27Jul 19, 2024Updated last year
- ☆25Oct 31, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆32Feb 28, 2025Updated last year
- My tests and experiments with some popular dl frameworks.☆17Sep 11, 2025Updated 8 months ago
- [EMNLP 2024] Quantize LLM to extremely low-bit, and finetune the quantized LLMs☆15Jul 18, 2024Updated last year
- Task Singular Vectors: Reducing Task Interference in Model Merging. Merge models avoiding task interference through separable models.☆54Dec 15, 2025Updated 4 months ago
- Sancho McCann's PhD Thesis Research Code☆25Oct 12, 2017Updated 8 years ago
- Implementation of LaViC (KDD 2025)☆12Jun 1, 2025Updated 11 months ago
- ☆13Dec 13, 2024Updated last year
- 北京大学 2024 秋季学期编译原理课程 Lab 代码、笔记、经验☆20Sep 12, 2025Updated 8 months ago
- The official source code for [2026 ICLR] "IR-Agent: Expert-Inspired LLM Agents for Structure Elucidation from Infrared Spectra"☆13Feb 25, 2026Updated 2 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [EMNLP 25] An effective and interpretable weight-editing method for mitigating overly short reasoning in LLMs, and a mechanistic study un…☆18Dec 17, 2025Updated 4 months ago
- An innovative method expediting LLMs via streamlined semi-autoregressive generation and draft verification.☆28Apr 15, 2025Updated last year
- ☆14May 4, 2024Updated 2 years ago
- [ICML 2024] Sparse Model Inversion: Efficient Inversion of Vision Transformers with Less Hallucination☆14Apr 29, 2025Updated last year
- [NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models☆39Jan 9, 2025Updated last year
- Implementation of NAACL'25 "Empowering Retrieval-based Conversational Recommendation with Contrasting User Preferences"☆14Sep 9, 2025Updated 8 months ago
- MultiLabel classification of cow diseases by text and symptoms recognition (NER)