Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation
☆25Feb 18, 2025Updated last year
Alternatives and similar repositories for CoLA
Users that are interested in CoLA are comparing it to the libraries listed below
Sorting:
- ☆19Feb 2, 2026Updated last month
- ☆25Oct 31, 2024Updated last year
- The official implementation of TinyTrain [ICML '24]☆24Jul 19, 2024Updated last year
- ☆19Nov 6, 2023Updated 2 years ago
- FireQ: Fast INT4-FP8 Kernel and RoPE-aware Quantization for LLM Inference Acceleration☆20Jun 27, 2025Updated 8 months ago
- [NeurIPS 2021] Code for Unsupervised Learning of Compositional Energy Concepts☆62Sep 21, 2022Updated 3 years ago
- An innovative method expediting LLMs via streamlined semi-autoregressive generation and draft verification.☆26Apr 15, 2025Updated 10 months ago
- [AAAI26] LongLLaDA: Unlocking Long Context Capabilities in Diffusion LLMs☆53Dec 7, 2025Updated 3 months ago
- [CVPR 2020] A generative model with latent factors that are independent and localized.☆12Mar 27, 2025Updated 11 months ago
- This is the code of a agentic rag method with dynamic workflow.☆12Jan 22, 2026Updated last month
- MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols☆17Nov 19, 2025Updated 3 months ago
- [CVPR 2025 Highlight] FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation☆26Jun 16, 2025Updated 8 months ago
- Implementation of LaViC (KDD 2025)☆12Jun 1, 2025Updated 9 months ago
- TOD-Flow: Modeling the Structure of Task-Oriented Dialogues☆13Feb 7, 2024Updated 2 years ago
- ☆10Sep 29, 2024Updated last year
- R1-Searcher++: Incentivizing the Dynamic Knowledge Acquisition of LLMs via Reinforcement Learning☆74May 25, 2025Updated 9 months ago
- BuildStockQuery is a python library for querying datasets generated by ResStock™ and ComStock™.☆14Dec 30, 2025Updated 2 months ago
- a Video Quality Analysis Toolkit☆13May 16, 2025Updated 9 months ago
- Code for our paper: "Building A Coding Assistant via Retrieval-Augmented Language Models"☆10Nov 2, 2024Updated last year
- This repository provides the code for applying Contrastive Learning Penalty Loss (CLPL) and Mixture of Experts (MoE) to the BGE-M3 text e…☆11Dec 27, 2024Updated last year
- ☆11Apr 5, 2023Updated 2 years ago
- The code for ”T-GRAG: A Dynamic GraphRAG Framework for Resolving Temporal Conflicts and Redundancy in Knowledge Retrieval“☆21Jul 30, 2025Updated 7 months ago
- ☆11Jun 12, 2024Updated last year
- ParamMute: Suppressing Knowledge-Critical FFNs for Faithful Retrieval-Augmented Generation☆57Feb 2, 2026Updated last month
- Information Extraction related tools and models☆10Mar 16, 2023Updated 2 years ago
- Sound Separation, Omni modal☆28Sep 15, 2025Updated 5 months ago
- [ICML 2025 Spotlight] RAPID: Long-Context Inference with Retrieval-Augmented Speculative Decoding☆19Mar 2, 2025Updated last year
- Implementation of IODINE model☆10Jun 7, 2019Updated 6 years ago
- PyTorch implementation of the SIESTA algorithm from our TMLR-2023 paper "SIESTA: Efficient Online Continual Learning with Sleep"☆13Oct 25, 2024Updated last year
- ☆10Nov 6, 2024Updated last year
- This is anonymous repository for submitting our work to a conference☆14Dec 17, 2024Updated last year
- 北京大学 2024 秋季学期编译原理课程 Lab 代码、笔记、经验☆16Sep 12, 2025Updated 5 months ago
- Source code of our TNNLS paper "Boosting Convolutional Neural Networks with Middle Spectrum Grouped Convolution"☆12Apr 14, 2023Updated 2 years ago
- ☆10Feb 22, 2023Updated 3 years ago
- [ICML 2025] Official PyTorch implementation of "NegMerge: Sign-Consensual Weight Merging for Machine Unlearning"☆14Nov 25, 2025Updated 3 months ago
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 4 months ago
- ☆17Dec 23, 2025Updated 2 months ago
- The official source code for [2026 ICLR] "IR-Agent: Expert-Inspired LLM Agents for Structure Elucidation from Infrared Spectra"☆11Feb 25, 2026Updated last week
- ☆18Mar 23, 2025Updated 11 months ago