☆25May 13, 2024Updated last year
Alternatives and similar repositories for VW-LMM
Users that are interested in VW-LMM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs☆45Jun 17, 2025Updated 10 months ago
- A huge dataset for Document Visual Question Answering☆23Jul 29, 2024Updated last year
- ☆14Sep 6, 2024Updated last year
- The official GitHub page for ''What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Ins…☆19Nov 10, 2023Updated 2 years ago
- The official repository of MM-R5☆29Jun 22, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- R1-Vision: Let's first take a look at the image☆48Feb 16, 2025Updated last year
- Open ChatGLM Eyes to See the World☆13Mar 30, 2023Updated 3 years ago
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆31Jul 9, 2024Updated last year
- Dataset accompanying the paper "Adaptive Methods for Real-World Domain Generalization"☆16Aug 17, 2023Updated 2 years ago
- [EMNLP 2025] Official codebase for Rearank: Reasoning Re-ranking Agent☆35Aug 20, 2025Updated 8 months ago
- Official code for DAM: Dynamic Adapter Merging for Continual Video QA Learning☆15Apr 25, 2024Updated 2 years ago
- [ICLR 2025] Adaptive prompt tailored pruning of T2I diffusion models.☆15Feb 1, 2025Updated last year
- A GPT-powered AI auto scraper for websites. AI Web Scraping made easy.☆14Jun 26, 2023Updated 2 years ago
- Official PyTorch implementation of "CleanCLIP: Mitigating Data Poisoning Attacks in Multimodal Contrastive Learning" @ ICCV 2023☆40Oct 16, 2025Updated 6 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Official Implementation for "SiLVR : A Simple Language-based Video Reasoning Framework"☆19Jan 18, 2026Updated 3 months ago
- Code for "CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning"☆33Mar 26, 2025Updated last year
- this is based on the paper Chain-of-Retrieval Augmented Generation☆14Mar 29, 2025Updated last year
- ☆20Feb 3, 2025Updated last year
- ☆12Jul 13, 2023Updated 2 years ago
- Thermal Indoor Motion Dataset☆16Apr 27, 2023Updated 3 years ago
- ☆34Jun 27, 2022Updated 3 years ago
- LaVIT: Empower the Large Language Model to Understand and Generate Visual Content☆604Oct 6, 2024Updated last year
- [CVPR 2025] 🔥 Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".☆459Aug 8, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- UMB: Understanding Model Behavior for Open-World object Detection (NeurIPS 2024)☆11May 26, 2024Updated last year
- [TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory☆19Apr 9, 2025Updated last year
- This repository contains the source code for the paper Wakey-Wakey: Animate Text by Mimicking Characters in a GIF☆14Jul 18, 2024Updated last year
- This is the official code for our paper "Simple and Scalable Nearest Neighbor Machine Translation" (ICLR 2023).☆14Nov 22, 2023Updated 2 years ago
- Companion code to https://arxiv.org/abs/2402.15491☆22Sep 18, 2025Updated 7 months ago
- Official implementation of SEED-LLaMA (ICLR 2024).☆641Sep 21, 2024Updated last year
- An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.☆162Apr 6, 2026Updated last month
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- Codes for ICLR 2025 Paper: Towards Semantic Equivalence of Tokenization in Multimodal LLM☆80Apr 19, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Code repository is for "Federated Composite Optimization", to appear in ICML 2021☆12May 6, 2022Updated 4 years ago
- StreamlitとLangGraphで実装したHuman-in-the-loop広告コピー文生成アプリケーション☆11Feb 15, 2025Updated last year
- ☆14Feb 21, 2024Updated 2 years ago
- Training-free Guidance in Text-to-Video Generation via Multimodal Planning and Structured Noise Initialization☆27Apr 14, 2025Updated last year
- ☆37Apr 9, 2026Updated 3 weeks ago
- This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.☆19Jun 27, 2024Updated last year
- The repo of the Doc2SoarGraph framework☆10Sep 17, 2024Updated last year