Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"
☆64Nov 21, 2024Updated last year
Alternatives and similar repositories for MatMamba
Users that are interested in MatMamba are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆23Jun 30, 2025Updated 11 months ago
- Code repository for the paper - "Neural Priming for Sample-Efficient Adaptation"☆14Nov 13, 2023Updated 2 years ago
- The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.☆14Mar 30, 2024Updated 2 years ago
- Staged Training for Transformer Language Models☆33Mar 31, 2022Updated 4 years ago
- Code for "Are “Hierarchical” Visual Representations Hierarchical?" in NeurIPS Workshop for Symmetry and Geometry in Neural Representation…☆23Nov 8, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The official code of "PixelWorld: Towards Perceiving Everything as Pixels" [TMLR25]☆16Sep 12, 2025Updated 8 months ago
- (AAAI'25) Training-and-pormpt Free General Painterly Image Harmonization Using image-wise attention sharing☆61Dec 17, 2024Updated last year
- Minimal docker example for development in robotics☆19Sep 11, 2025Updated 9 months ago
- Public repository for the ECCV 2024 paper "Train Till You Drop: Towards Stable and Robust Source-free Unsupervised 3D Domain Adaptation".☆28Aug 5, 2025Updated 10 months ago
- [ACM MM 2025] Mobile U-ViT: Revisiting large kernel and U-shaped ViT for efficient medical image segmentation☆60Oct 29, 2025Updated 7 months ago
- PreciseCam: Precise Camera Control for Text-to-Image Generation☆25May 7, 2025Updated last year
- [ACCV 2024 ] Official code for "DeBiFormer: Vision Transformer with Deformable Agent Bi-level Routing Attention"☆33Jan 8, 2025Updated last year
- This is an official implementation of our CVPR 2020 paper "Non-Local Neural Networks With Grouped Bilinear Attentional Transforms".☆13Jan 30, 2021Updated 5 years ago
- ☆13Oct 29, 2021Updated 4 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- dinov2 features aligned with CLIP☆22Jul 9, 2024Updated last year
- German Language Understanding Evaluation Benchmark @NAACL24☆23Dec 11, 2025Updated 6 months ago
- PyTorch Implementation for the paper "C3VQG: Category Consistent Cyclic Visual Question Generation" (ACM MM Asia'20).☆16Mar 31, 2023Updated 3 years ago
- BRL Flight Arena Infrastructure 2.0☆18Mar 17, 2023Updated 3 years ago
- Official PyTorch implementation of "Generalized Consistency Trajectory Models for Image Manipulation"☆44Mar 31, 2024Updated 2 years ago
- This is the repository for "SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Recognition"☆16Oct 8, 2024Updated last year
- ✂️ Sentence segmentation with wtpsplit's state-of-the-art Segment any Text (SaT) models☆39May 2, 2026Updated last month
- STABILIZING GRADIENTS FOR DEEP NEURAL NETWORKS VIA EFFICIENT SVD PARAMETERIZATION☆16Jun 5, 2018Updated 8 years ago
- This is a laboratory code of paper---MMDRFuse: Distilled Mini-Model with Dynamic Refresh for Multi-Modality Image Fusion☆26Sep 3, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆14Dec 2, 2018Updated 7 years ago
- The official repository for Toxic Commons and Celadon. Toxicity Classification for public domain data.☆22May 8, 2026Updated last month
- ☆11Jan 16, 2024Updated 2 years ago
- (ACM MM24) This is the offical repository of GIST: Improving Parameter Efficient Fine Tuning via Knowledge Interaction.☆11Jan 28, 2024Updated 2 years ago
- Mamba support for transformer lens☆20Sep 17, 2024Updated last year
- Model-Based Image Inpainting☆17Sep 10, 2024Updated last year
- What Can You Learn from Your Muscles? Learning Visual Representation from Human Interactions (https://arxiv.org/pdf/2010.08539.pdf)☆39Mar 30, 2021Updated 5 years ago
- PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.☆12Nov 27, 2024Updated last year
- Code and data releases for the paper -- DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory☆62Feb 10, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Compare openresty vs nginx + PUC_lua☆18Nov 3, 2023Updated 2 years ago
- Code repository for the paper "MrT5: Dynamic Token Merging for Efficient Byte-level Language Models."☆58Sep 25, 2025Updated 8 months ago
- Cross-Self KV Cache Pruning for Efficient Vision-Language Inference☆10Dec 15, 2024Updated last year
- PyTorch implementation of the conditional variational autoencoder (CVAE) from CodeSLAM☆13Jun 20, 2022Updated 3 years ago
- Noise-robust de-duplication at scale☆19Apr 9, 2023Updated 3 years ago
- CVPR 2025 Workshop on CVEU.☆42Jun 12, 2025Updated 11 months ago
- Official implementation of Add-SD: Rational Generation without Manual Reference.☆28Aug 19, 2024Updated last year