The code repository for "Wings: Learning Multimodal LLMs without Text-only Forgetting" [NeurIPS 2024]
☆27Dec 28, 2024Updated last year
Alternatives and similar repositories for Wings
Users that are interested in Wings are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13May 9, 2023Updated 2 years ago
- Official eval code for ROVER: Benchmarking Reciprocal Cross-Modal Reasoning for Omnimodal Generation☆27Dec 12, 2025Updated 4 months ago
- This is the official code repository for the paper: Towards General Continuous Memory for Vision-Language Models.☆25Jul 3, 2025Updated 9 months ago
- Math-VR Benchmark & CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven Images☆56Nov 4, 2025Updated 5 months ago
- Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.☆32Feb 26, 2025Updated last year
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆10Mar 4, 2024Updated 2 years ago
- The benchmark and datasets of the ICML 2024 paper "VisionGraph: Leveraging Large Multimodal Models for Graph Theory Problems in Visual C…☆17May 27, 2024Updated last year
- Official code for the ICLR 2025 paper, "Ada-K Routing: Boosting the Efficiency of MoE-based LLMs"☆12Mar 1, 2025Updated last year
- An Easy-to-use Hallucination Detection Framework for LLMs.☆63Apr 21, 2024Updated last year
- An unified model that seamlessly integrates multimodal understanding, text-to-image generation, and image editing within a single powerfu…☆452Dec 2, 2025Updated 4 months ago
- [ICML'25] Official code of paper "Fast Large Language Model Collaborative Decoding via Speculation"☆30Jun 23, 2025Updated 9 months ago
- Supporting code for ReCEval paper☆32Sep 14, 2024Updated last year
- AdaICL: Which Examples to Annotate of In-Context Learning? Towards Effective and Efficient Selection☆19Oct 30, 2023Updated 2 years ago
- Code for paper: Unraveling the Shift of Visual Information Flow in MLLMs: From Phased Interaction to Efficient Inference☆13Jun 7, 2025Updated 10 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- [CVPR 2026] Thinking with Programming Vision: Towards a Unified View for Thinking with Images☆68Jan 23, 2026Updated 2 months ago
- ACL 2024 (SRW), Official Codebase of our Paper: "MoExtend: Tuning New Experts for Modality and Task Extension"☆15Dec 3, 2024Updated last year
- ☆14Jul 15, 2025Updated 9 months ago
- [ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"☆12Dec 6, 2024Updated last year
- UMB: Understanding Model Behavior for Open-World object Detection (NeurIPS 2024)☆11May 26, 2024Updated last year
- Mini library for collecting images from google streets view. Generally designed for collecting datasets for ML☆11Nov 15, 2021Updated 4 years ago
- ☆15Jul 22, 2024Updated last year
- Code for Research Project TLDR☆25Jul 28, 2025Updated 8 months ago
- CCF 2021 BDCI 千言-问题匹配鲁棒性评测 A榜 rank 29th, B榜 rank 15th☆14Jan 5, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- The source code of Mem-Gallery: Benchmarking Multimodal Long-Term Conversational Memory for MLLM Agents.☆47Jan 31, 2026Updated 2 months ago
- Awesome Unified Multimodal Models☆1,211Mar 24, 2026Updated 3 weeks ago
- A unified neural-symbolic framework for solving plane and solid geometric problems via Parse2Reason & Official repository for the CVPR 20…☆40Apr 8, 2026Updated last week
- ☆13Feb 21, 2024Updated 2 years ago
- ☆13Dec 2, 2019Updated 6 years ago
- Code for ACL 2023 paper "Rethinking Multimodal Entity and Relation Extraction from a Translation Point of View"☆25Jan 18, 2026Updated 3 months ago
- [NeurIPS 2024] Official Code for the Paper "Multimodal Task Vectors Enable Many-Shot Multimodal In-Context Learning"☆28Apr 8, 2025Updated last year
- Description for MV-MATH☆15Jul 20, 2025Updated 8 months ago
- Discrete Diffusion VLA: Bringing Discrete Diffusion to Action Decoding in Vision-Language-Action Policies☆61Dec 3, 2025Updated 4 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- The official repo of continuous speculative decoding☆33Mar 28, 2025Updated last year
- ☆12Oct 10, 2024Updated last year
- 多模态深度学习技术基础☆20Nov 15, 2024Updated last year
- A huge dataset for Document Visual Question Answering☆22Jul 29, 2024Updated last year
- Accelerating the development of large multimodal models (LMMs) with lmms-eval☆14Oct 14, 2024Updated last year
- Code of ACM MM 2023 Paper: A Symbolic Characters Aware Model for Solving Geometry Problems☆16Dec 27, 2023Updated 2 years ago
- (ACL 2025 Main) Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification - Offici…☆20Dec 26, 2025Updated 3 months ago