Adamdad / rational_kat_cu
☆37Updated last month
Related projects ⓘ
Alternatives and complementary repositories for rational_kat_cu
- [ICML 2024 Oral] This project is the official implementation of our Accurate LoRA-Finetuning Quantization of LLMs via Information Retenti…☆58Updated 6 months ago
- A repository for DenseSSMs☆88Updated 7 months ago
- [ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Di…☆50Updated 5 months ago
- Curated list of methods that focuses on improving the efficiency of diffusion models☆29Updated 4 months ago
- Official implementation of "Hydra: Bidirectional State Space Models Through Generalized Matrix Mixers"☆102Updated 3 months ago
- Code for paper "Unsegment Anything by Simulating Deformation" (CVPR 2024)☆22Updated 5 months ago
- Code for NOLA, an implementation of "nola: Compressing LoRA using Linear Combination of Random Basis"☆48Updated 2 months ago
- State Space Models☆62Updated 6 months ago
- [ICLR 2024] Improving Convergence and Generalization Using Parameter Symmetries☆28Updated 5 months ago
- My implementation of the original transformer model (Vaswani et al.). I've additionally included the playground.py file for visualizing o…☆41Updated 11 months ago
- [ICML 2024] When Linear Attention Meets Autoregressive Decoding: Towards More Effective and Efficient Linearized Large Language Models☆25Updated 5 months ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆71Updated 3 months ago
- [ECCV 2024] Official pytorch implementation of "Switch Diffusion Transformer: Synergizing Denoising Tasks with Sparse Mixture-of-Experts"☆32Updated 4 months ago
- Official implementation of "Parameter-Efficient Orthogonal Finetuning via Butterfly Factorization"☆74Updated 7 months ago
- More dimensions = More fun☆21Updated 3 months ago
- [ECCV 2024] Isomorphic Pruning for Vision Models☆51Updated 3 months ago
- A More Fair and Comprehensive Comparison between KAN and MLP☆148Updated 2 months ago
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆44Updated last year
- The codebase of our paper "Improving the Training of Rectified Flows"☆80Updated 3 weeks ago
- GIFT: Generative Interpretable Fine-Tuning☆18Updated last month
- HGRN2: Gated Linear RNNs with State Expansion☆49Updated 2 months ago
- Official code for the paper "Attention as a Hypernetwork"☆23Updated 4 months ago
- A curated list of Model Merging methods.☆82Updated last month
- [ICML'24 Oral] APT: Adaptive Pruning and Tuning Pretrained Language Models for Efficient Training and Inference☆28Updated 5 months ago
- Awesome list of papers that extend Mamba to various applications.☆127Updated last month
- Official code for the paper "Image generation with shortest path diffusion" accepted at ICML 2023.☆21Updated last year
- Towards Meta-Pruning via Optimal Transport, ICLR 2024 (Spotlight)☆12Updated 7 months ago
- A Triton Kernel for incorporating Bi-Directionality in Mamba2☆47Updated 2 months ago
- ☆22Updated last year
- ☆52Updated last year