Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.
☆32Feb 26, 2025Updated last year
Alternatives and similar repositories for VCR
Users that are interested in VCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation☆18Sep 2, 2024Updated last year
- ☆17Feb 22, 2024Updated 2 years ago
- Enable Comprehensive LLM Evaluation on Graph Reasoning☆79Jun 12, 2025Updated last year
- The code repository for "Wings: Learning Multimodal LLMs without Text-only Forgetting" [NeurIPS 2024]☆27Dec 28, 2024Updated last year
- ☆13May 9, 2023Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Easy no-frills Jax implementations of common abstractions for simple diffusion models.☆11Feb 23, 2026Updated 3 months ago
- [SCIS 2024] The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Di…☆63Nov 7, 2024Updated last year
- [ICLR 2025 Spotlight] Code release for "Sharpness-Aware Minimization Efficiently Selects Flatter Minima Late In Training"☆19Feb 20, 2025Updated last year
- SPICE: A Synergistic, Precise, Iterative, and Customizable Image Editing Workflow☆25Feb 6, 2026Updated 4 months ago
- ☆10Nov 12, 2024Updated last year
- Official repo for From Intention to Execution: Probing the Generalization Boundaries of Vision-Language-Action Models☆34Nov 2, 2025Updated 7 months ago
- ☆17Oct 22, 2024Updated last year
- Linked Data to Natural Language☆11Jan 6, 2024Updated 2 years ago
- [ACL 2025] Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL☆16Oct 9, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- QALD-9-Plus Dataset for Knowledge Graph Question Answering☆29Jun 5, 2024Updated 2 years ago
- Code related to ’Beyond spectral gap: The role of the topology in decentralized learning‘.☆14Jun 7, 2022Updated 4 years ago
- ☆11Feb 14, 2022Updated 4 years ago
- Code for the paper "Spectrum Guided Topology Augmentation for Graph Contrastive Learning"☆11Jul 18, 2023Updated 2 years ago
- Official repository of MMDU dataset☆106Sep 29, 2024Updated last year
- 浙江大学Beamer模板☆16May 19, 2022Updated 4 years ago
- [NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs☆153Apr 22, 2025Updated last year
- ☆11Jan 3, 2024Updated 2 years ago
- [ICML 2024] | MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI☆117Apr 6, 2026Updated 2 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆20Jan 10, 2025Updated last year
- Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding☆54Dec 12, 2024Updated last year
- ☆77Mar 23, 2026Updated 2 months ago
- Official Repository of MMLONGBENCH-DOC: Benchmarking Long-context Document Understanding with Visualizations☆147Sep 28, 2025Updated 8 months ago
- ☆47Nov 8, 2024Updated last year
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆43Jan 18, 2026Updated 5 months ago
- [NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations☆19Jan 19, 2025Updated last year
- ☆68Feb 1, 2025Updated last year
- PDA: Privacy-preserving Distributed Algorithms☆15Feb 5, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling☆37Jul 12, 2024Updated last year
- [ICLR 2025] Mathematical Visual Instruction Tuning for Multi-modal Large Language Models☆155Dec 5, 2024Updated last year
- The MEDS Decentralized Extensible Validation (MEDS-DEV) Benchmark: Establishing Reproducibility and Comparability in ML for Health☆39May 14, 2026Updated last month
- Dataset Resplitting for Generalization in KGQA. See also https://github.com/semantic-systems/KGQA-datasets☆17Jun 29, 2022Updated 3 years ago
- ☆14Oct 28, 2022Updated 3 years ago
- ☆48Sep 5, 2024Updated last year
- Unofficial Paddle implementation of "Swin Transformer V2: Scaling Up Capacity and Resolution"☆33Nov 28, 2021Updated 4 years ago