☆27May 13, 2025Updated 10 months ago
Alternatives and similar repositories for dymu
Users that are interested in dymu are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Apr 25, 2025Updated 11 months ago
- SurgLaVi: Official repository☆29Mar 4, 2026Updated 3 weeks ago
- CoV: Chain-of-View Prompting for Spatial Reasoning☆52Jan 23, 2026Updated 2 months ago
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- [ECCV 2024] Learning Video Context as Interleaved Multimodal Sequences☆44Mar 11, 2025Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆15May 3, 2023Updated 2 years ago
- [ICML 2025] This is the official PyTorch implementation of "OmniBal: Towards Fast Instruction-Tuning for Vision-Language Models via Omniv…☆27Jun 16, 2025Updated 9 months ago
- Solving Token Gradient Conflict in Mixture-of-Experts for Large Vision-Language Model☆13Feb 11, 2025Updated last year
- PathPiece tokenizer☆14Nov 10, 2024Updated last year
- [ICLR 2025] The official pytorch implement of "Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Cont…☆72Sep 18, 2025Updated 6 months ago
- [ICLR 2025] Official Pytorch Implementation of "Mix-LN: Unleashing the Power of Deeper Layers by Combining Pre-LN and Post-LN" by Pengxia…☆29Jul 24, 2025Updated 8 months ago
- Official repository for the paper "Random Shuffle Transformer for Image Restoration".☆17Jan 9, 2024Updated 2 years ago
- An adaptive sampling framework for Reinforce-style LLM post training.☆92Nov 29, 2025Updated 4 months ago
- ☆12Apr 19, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- VideoNSA: Native Sparse Attention Scales Video Understanding☆81Nov 16, 2025Updated 4 months ago
- [IJCAI 2025] In-Context Meta LoRA Generation☆31Jul 29, 2025Updated 8 months ago
- [ICML 2025] Code for "R2-T2: Re-Routing in Test-Time for Multimodal Mixture-of-Experts"☆19Mar 10, 2025Updated last year
- [CVPR 2026] Accelerating Streaming Video Large Language Models via Hierarchical Token Compression☆51Feb 25, 2026Updated last month
- Official PyTorch implementation of the Winner Award solution of MIPI 2024 Demosaic for HybridEVS Camera Challenge (CVPR Workshop 2024).☆16Jan 28, 2025Updated last year
- [CVPR 2025] PVC: Progressive Visual Token Compression for Unified Image and Video Processing in Large Vision-Language Models☆52Jun 12, 2025Updated 9 months ago
- https://footprints.baulab.info☆18Oct 4, 2024Updated last year
- An implementation of several unsupervised object discovery models (Slot Attention, SLATE, GNM) in PyTorch with pre-trained models.☆15May 26, 2025Updated 10 months ago
- ☆17Feb 4, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Implementation code for ACL2024:Advancing Parameter Efficiency in Fine-tuning via Representation Editing☆15Apr 20, 2024Updated last year
- [COLING 2025] Official repo of paper: "Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jail…☆12Jul 26, 2024Updated last year
- A simple and minimal open source implementation of "Introducing LFM2: The Fastest On-Device Foundation Models on the Market" from Liquid …☆23Mar 22, 2026Updated last week
- ☆49Apr 4, 2025Updated 11 months ago
- DL Backtrace is a new explainablity technique for deep learning models that works for any modality and model type.☆25Updated this week
- This is the repo for "Adaptive Unimodal Regulation for Balanced Multimodal Information Acquisition", CVPR2025.☆20Dec 22, 2025Updated 3 months ago
- Official Code for "Painting with Words: Elevating Detailed Image Captioning with Benchmark and Alignment Learning" (ICLR 2025)☆12Mar 6, 2025Updated last year
- Multi-Modal Tree of thoughts for DALLE-3 like auto self improvement☆17Nov 11, 2024Updated last year
- [IEEE TIP] Offical implementation for the work "BadCM: Invisible Backdoor Attack against Cross-Modal Learning".☆14Aug 30, 2024Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- mit6.830 all-pass☆12Mar 25, 2022Updated 4 years ago
- iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models (ICLR2026)☆21Mar 10, 2026Updated 2 weeks ago
- Code release for VTW (AAAI 2025 Oral)☆66Nov 4, 2025Updated 4 months ago
- ☆20Jul 18, 2022Updated 3 years ago
- Crossmodal Translation based Meta Weight Adaption for Robust Image-Text Sentiment Analysis☆15May 16, 2024Updated last year
- [ICLR'26] Traceable Evidence Enhanced Visual Grounded Reasoning: Evaluation and Methodology☆78Jan 26, 2026Updated 2 months ago
- ☆19Aug 15, 2018Updated 7 years ago