Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"
☆62Nov 21, 2024Updated last year
Alternatives and similar repositories for MatMamba
Users that are interested in MatMamba are comparing it to the libraries listed below
Sorting:
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆22Jun 30, 2025Updated 8 months ago
- [ECCV'24 Workshops Oral] DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling☆31Feb 6, 2026Updated 3 weeks ago
- Code associated with the EMNLP 2024 Main paper: "Image, tell me your story!" Predicting the original meta-context of visual misinformatio…☆45Dec 6, 2025Updated 2 months ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆13Mar 30, 2024Updated last year
- Public repository for the ECCV 2024 paper "Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation".☆26Aug 5, 2025Updated 6 months ago
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]☆16Sep 12, 2025Updated 5 months ago
- Staged Training for Transformer Language Models☆33Mar 31, 2022Updated 3 years ago
- [ACCV 2024 ] Official code for "DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention"☆32Jan 8, 2025Updated last year
- The first dense retrieval model that can be prompted like an LM☆90May 8, 2025Updated 9 months ago
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆60Dec 17, 2024Updated last year
- Code repository for the paper - "Neural Priming for Sample-Efficient Adaptation"☆14Nov 13, 2023Updated 2 years ago
- [AAAI24] Learning Invariant Inter-pixel Correlations for Superpixel Generation☆14Mar 27, 2024Updated last year
- PathPiece tokenizer☆13Nov 10, 2024Updated last year
- XmodelLM☆38Nov 19, 2024Updated last year
- dinov2 features aligned with CLIP☆21Jul 9, 2024Updated last year
- Unit Scaling demo and experimentation code☆16Mar 12, 2024Updated last year
- [CCS 2024] "BadMerging: Backdoor Attacks Against Model Merging": official code implementation.☆35Aug 22, 2024Updated last year
- Official repository for PTAViT3D and PTAViT3DCA models for field boundaries detection using S2 and/or S1 imagery.☆39Sep 24, 2024Updated last year
- Model-Based Image Inpainting☆17Sep 10, 2024Updated last year
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆45Feb 9, 2026Updated 3 weeks ago
- PyTorch Implementation for the paper "C3VQG: Category Consistent Cyclic Visual Question Generation" (ACM MM Asia'20).☆17Mar 31, 2023Updated 2 years ago
- Noise-robust de-duplication at scale☆19Apr 9, 2023Updated 2 years ago
- Code for "Merging Text Transformers from Different Initializations"☆20Feb 2, 2025Updated last year
- Official PyTorch implementation of "Generalized Consistency Trajectory Models for Image Manipulation"☆44Mar 31, 2024Updated last year
- [NeurIPS 2024] Token Merging for Training-Free Semantic Binding in Text-to-Image Synthesis☆86Feb 3, 2025Updated last year
- Super-fast BART (Bayesian Additive Regression Trees) in Python☆82Feb 20, 2026Updated last week
- Speed up Transformers With Spectrum-Preserving Token Merging☆52Feb 9, 2025Updated last year
- ✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models☆36Oct 1, 2025Updated 5 months ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆22Nov 8, 2023Updated 2 years ago
- Code repository for the paper "MrT5: Dynamic Token Merging for Efficient Byte-level Language Models."☆54Sep 25, 2025Updated 5 months ago
- [IEEE Trans. AI 2024] Spiking Diffusion Models☆50Apr 13, 2025Updated 10 months ago
- [ICLR 2025] IterComp: Iterative Composition-Aware Feedback Learning from Model Gallery for Text-to-Image Generation☆203Feb 19, 2025Updated last year
- PhysMamba: Efficient Remote Physiological Measurement with SlowFast Temporal Difference Mamba☆58Nov 14, 2024Updated last year
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆29Jul 24, 2025Updated 7 months ago
- (CVPR 2025) Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA…☆26Aug 23, 2025Updated 6 months ago
- [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling D…☆25Apr 30, 2024Updated last year
- ☆41May 15, 2025Updated 9 months ago
- [EMNLP'2025] "EasyRec: Simple yet Effective Language Model for Recommendation"☆138Nov 3, 2025Updated 3 months ago
- Open-source Python toolkit focused on deep learning with ordinal methodologies☆68Dec 19, 2025Updated 2 months ago