pengts/VW-LMM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/pengts/VW-LMM)

pengts / VW-LMM

☆25

Alternatives and similar repositories for VW-LMM

Users that are interested in VW-LMM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

albertwy / GPT-4V-Evaluation
View on GitHub
Data for evaluating GPT-4V
☆11Oct 26, 2023Updated 2 years ago
huggingface / docmatix
View on GitHub
A huge dataset for Document Visual Question Answering
☆24Jul 29, 2024Updated 2 years ago
Line-Kite / GraphLayoutLM
View on GitHub
☆14Sep 6, 2024Updated last year
Hxyou / IdealGPT
View on GitHub
Official Code of IdealGPT
☆39Mar 3, 2026Updated 4 months ago
RUCAIBox / ComVint
View on GitHub
The official GitHub page for ''What Makes for Good Visual Instructions? Synthesizing Complex Visual Reasoning Instructions for Visual Ins…
☆19Nov 10, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
QC-LY / UiG
View on GitHub
Code for "Understanding-in-Generation:Reinforcing Generative Capability of Unified Model via Infusing Understanding into Generation"
☆15Nov 11, 2025Updated 8 months ago
InternScience / MME-Reasoning
View on GitHub
Official Repository: A Comprehensive Benchmark for Logical Reasoning in MLLMs
☆45Jun 17, 2025Updated last year
yuyq96 / R1-Vision
View on GitHub
R1-Vision: Let's first take a look at the image
☆48Feb 16, 2025Updated last year
ATH-MaaS / Wings
View on GitHub
The code repository for "Wings: Learning Multimodal LLMs without Text-only Forgetting" [NeurIPS 2024]
☆27Dec 28, 2024Updated last year
abhimanyudubey / GeoYFCC
View on GitHub
Dataset accompanying the paper "Adaptive Methods for Real-World Domain Generalization"
☆16Aug 17, 2023Updated 2 years ago
hhaAndroid / awesome-python-cn
View on GitHub
Python资源大全中文版，内容包括：Web框架、网络爬虫、网络内容提取、模板引擎、数据库、数据可视化、图片处理、文本处理、自然语言处理、机器学习、日志、代码分析等
☆11May 24, 2016Updated 10 years ago
nishadsinghi / CleanCLIP
View on GitHub
Official PyTorch implementation of "CleanCLIP: Mitigating Data Poisoning Attacks in Multimodal Contrastive Learning" @ ICCV 2023
☆40Oct 16, 2025Updated 9 months ago
xxyzll / UMB
View on GitHub
UMB: Understanding Model Behavior for Open-World object Detection (NeurIPS 2024)
☆12May 26, 2024Updated 2 years ago
NEUIR / RankCoT
View on GitHub
[ACL '25] Source code for our paper ''RankCoT: Refining Knowledge for Retrieval-Augmented Generation through Ranking Chain-of-Thoughts''
☆53Nov 27, 2025Updated 8 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
bo-miao / HTR
View on GitHub
[TCSVT 2024] Temporally Consistent Referring Video Object Segmentation with Hybrid Memory
☆19Apr 9, 2025Updated last year
OpenMatch / SANTA
View on GitHub
☆12Jul 13, 2023Updated 3 years ago
jianzhu / dl-rerank
View on GitHub
☆11May 8, 2020Updated 6 years ago
jy0205 / LaVIT
View on GitHub
LaVIT: Empower the Large Language Model to Understand and Generate Visual Content
☆603Oct 6, 2024Updated last year
Yui010206 / MEXA
View on GitHub
[EMNLP 2025 Findings] MEXA: Towards General Multimodal Reasoning with Dynamic Multi-Expert Aggregation
☆15Aug 22, 2025Updated 11 months ago
sanowl / CoRAG
View on GitHub
this is based on the paper Chain-of-Retrieval Augmented Generation
☆15Mar 29, 2025Updated last year
jiayao / mcp-chess
View on GitHub
MCP server for playing chess against AI
☆23May 5, 2025Updated last year
xianzhangzx / FINER-MLLM
View on GitHub
The implementation of FINER-MLLM, which is accepted by MM2024.
☆18Oct 8, 2024Updated last year
ucasyjz / VIP
View on GitHub
[ACCV 2024 Poster] official code for "VIP: Versatile Image Outpainting Empowered by Multimodal Large Language Model"
☆10Sep 28, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
alexrame / diwa
View on GitHub
DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization
☆31Jan 31, 2023Updated 3 years ago
AILab-CVC / SEED
View on GitHub
Official implementation of SEED-LLaMA (ICLR 2024).
☆642Sep 21, 2024Updated last year
ChocoWu / SeTok
View on GitHub
Codes for ICLR 2025 Paper: Towards Semantic Equivalence of Tokenization in Multimodal LLM
☆81Apr 19, 2025Updated last year
xiaomabufei / SKDF
View on GitHub
☆14Feb 21, 2024Updated 2 years ago
SRI-CSL / TrinityMultimodalTrojAI
View on GitHub
☆35Jun 27, 2022Updated 4 years ago
yamato0811 / streamlit-langgraph-HITL-copy-generator
View on GitHub
StreamlitとLangGraphで実装したHuman-in-the-loop広告コピー文生成アプリケーション
☆11Feb 15, 2025Updated last year
OpenRLHF / OpenRLHF-M
View on GitHub
An Easy-to-use, Scalable and High-performance RLHF Framework designed for Multimodal Models.
☆163Apr 6, 2026Updated 3 months ago
HanSolo9682 / CounterCurate
View on GitHub
This is the implementation of CounterCurate, the data curation pipeline of both physical and semantic counterfactual image-caption pairs.
☆19Jun 27, 2024Updated 2 years ago
fengbinzhu / Doc2SoarGraph
View on GitHub
The repo of the Doc2SoarGraph framework
☆10Sep 17, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
miaoyuchun / InfoRM
View on GitHub
The official implementation of InfoRM [NeurIPS 2024].
☆16Oct 25, 2025Updated 9 months ago
AdamRain / YFCC15M_downloader
View on GitHub
A subset of YFCC100M. Tools, checking scripts and links of web drive to download datasets(uncompressed).
☆19Nov 13, 2024Updated last year
edenartlab / flux-trainer
View on GitHub
Eden Flux LoRA trainer and full-finetuning
☆23Mar 21, 2025Updated last year
lemurproject / ClueWeb22
View on GitHub
☆17Dec 11, 2024Updated last year
orrzohar / LOVM
View on GitHub
[NeurIPS 2023] Official Pytorch code for LOVM: Language-Only Vision Model Selection
☆21Feb 3, 2024Updated 2 years ago
levymsn / ChatIR
View on GitHub
Official repository of "Chatting Makes Perfect: Chat-based Image Retrieval"
☆33Feb 5, 2025Updated last year
daspartho / DiffEdit
View on GitHub
my attempt at implementing the DiffEdit paper (WIP)
☆16Oct 30, 2022Updated 3 years ago