AlgonetLabs / CableLinks
Context-aware Biases for Length Extrapolation
☆22Updated 8 months ago
Alternatives and similar repositories for Cable
Users that are interested in Cable are comparing it to the libraries listed below
Sorting:
- A Comprehensive Survey on Knowledge Distillation☆60Updated last month
- [NeurIPS 2023] Code base for the Renyi Kernel Entropy (RKE) metric for generative models.☆13Updated 7 months ago
- Awesome list of papers that extend Mamba to various applications.☆138Updated 7 months ago
- Notes on the Mamba and the S4 model (Mamba: Linear-Time Sequence Modeling with Selective State Spaces)☆179Updated 2 years ago
- KD-LoRA: A Hybrid Approach to Efficient Fine-Tuning with LoRA and Knowledge Distillation☆21Updated last year
- [ICML 2024] VQDNA: Unleashing the Power of Vector Quantization for Multi-Species Genomic Sequence Modeling☆10Updated last year
- Official PyTorch Implementation of "The Hidden Attention of Mamba Models"☆231Updated 3 months ago
- [NeurIPS 2024] BEACON: Benchmark for Comprehensive RNA Tasks and Language Models☆59Updated last year
- ☆19Updated last month
- Official PyTorch implementation of Which Tokens to Use? Investigating Token Reduction in Vision Transformers presented at ICCV 2023 NIVT …☆35Updated 2 years ago
- Official implementation of 'P$^2$OT: Progressive Partial Optimal Transport for Deep Imbalanced Clustering'. (Accepted by ICLR 2024)☆17Updated 2 years ago
- State Space Models☆72Updated last year
- [ICCV23] Robust Mixture-of-Expert Training for Convolutional Neural Networks by Yihua Zhang, Ruisi Cai, Tianlong Chen, Guanhua Zhang, Hua…☆67Updated 2 years ago
- This is the open-source code for TokenCarve.☆23Updated 2 weeks ago
- Official PyTorch implementation of Agglomerative Token Clustering presented at ECCV 2024☆19Updated last year
- ☆39Updated 9 months ago
- Unlock the potential of latent diffusion models with MNIST! 🚀 Dive into reconstructing and generating digits using cutting-edge techniqu…☆16Updated last year
- Reading list for research topics in state-space models☆344Updated 7 months ago
- The official implementation for MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning (CVPR '24)☆69Updated 7 months ago
- The official implementation of "2024NeurIPS Dynamic Tuning Towards Parameter and Inference Efficiency for ViT Adaptation"☆52Updated last year
- 【NeurIPS 2024】Official implementation of "Visual Fourier Prompt Tuning"☆39Updated last year
- Introduction to Bioinformatics Course Slides and Material, Computer Engineering Department, Sharif University of Technology☆33Updated last month
- ☆79Updated last year
- Graph-Mamba: Towards Long-Range Graph Sequence Modelling with Selective State Spaces☆335Updated 2 years ago
- Mamba in Vision: A Comprehensive Survey of Techniques and Applications☆133Updated last year
- Learning to Estimate Shapley Values with Vision Transformers☆37Updated last year
- [Preprint] Graph State Space Convolution (GSSC)☆14Updated last year
- ☆80Updated 11 months ago
- A token pruning method that accelerates ViTs for various tasks while maintaining high performance.☆28Updated 6 months ago
- Minimal Mamba-2 implementation in PyTorch☆242Updated last year