tianyu-z/VCR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/tianyu-z/VCR)

tianyu-z / VCR

Official Repo for the paper: VCR: Visual Caption Restoration. Check arxiv.org/pdf/2406.06462 for details.

☆32

Alternatives and similar repositories for VCR

Users that are interested in VCR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

luli-git / MAP
View on GitHub
MAP: Low-compute Model Merging with Amortized Pareto Fronts via Quadratic Approximation
☆18Sep 2, 2024Updated last year
vision-x-nyu / test-set-training
View on GitHub
☆15Nov 25, 2025Updated 8 months ago
core-mm / core-mm
View on GitHub
☆17Feb 22, 2024Updated 2 years ago
dali-does / clevr-math
View on GitHub
☆13May 9, 2023Updated 3 years ago
DruvPai / DiffusionLab
View on GitHub
Easy no-frills Jax implementations of common abstractions for simple diffusion models.
☆11Feb 23, 2026Updated 5 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
yuecao0119 / MMInstruct
View on GitHub
[SCIS 2024] The official implementation of the paper "MMInstruct: A High-Quality Multi-Modal Instruction Tuning Dataset with Extensive Di…
☆64Nov 7, 2024Updated last year
princeton-nlp / CharXiv
View on GitHub
[NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs
☆160Apr 22, 2025Updated last year
LinfengYuan1997 / LoSh
View on GitHub
[CVPR 2024] LoSh: Long-Short Text Joint Prediction Network for Referring Video Object Segmentation
☆13Jun 17, 2024Updated 2 years ago
bytedance / MTVQA
View on GitHub
MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering. A comprehensive evaluation of multimodal large model multilingua…
☆64May 15, 2025Updated last year
jiachenzhu / VCR
View on GitHub
Variance Covariance Regularization
☆14Jun 22, 2023Updated 3 years ago
alex-damian / EOS
View on GitHub
☆15Sep 29, 2022Updated 3 years ago
ATH-MaaS / Wings
View on GitHub
The code repository for "Wings: Learning Multimodal LLMs without Text-only Forgetting" [NeurIPS 2024]
☆27Dec 28, 2024Updated last year
Healthcare-Data-Mining-Laboratory / EHR-KnowGen
View on GitHub
☆11Jun 28, 2023Updated 3 years ago
OpenGVLab / DiffAgent
View on GitHub
[CVPR 2024] DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model
☆19Apr 16, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ZhangQueque / quewaner.Crawler
View on GitHub
爬虫案例
☆14Apr 29, 2021Updated 5 years ago
ServiceNow / GroundCUA
View on GitHub
GroundCUA
☆132Mar 24, 2026Updated 4 months ago
dice-group / LD2NL
View on GitHub
Linked Data to Natural Language
☆11Jan 6, 2024Updated 2 years ago
KGQA / QALD_9_plus
View on GitHub
QALD-9-Plus Dataset for Knowledge Graph Question Answering
☆29Jun 5, 2024Updated 2 years ago
kq-chen / VLMEvalKit
View on GitHub
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 30+ benchmarks
☆15Feb 17, 2025Updated last year
Liuziyu77 / MMDU
View on GitHub
Official repository of MMDU dataset
☆108Sep 29, 2024Updated last year
Louise-LuLin / GCL-SPAN
View on GitHub
Code for the paper "Spectrum Guided Topology Augmentation for Graph Contrastive Learning"
☆11Jul 18, 2023Updated 3 years ago
epfml / topology-in-decentralized-learning
View on GitHub
Code related to ’Beyond spectral gap: The role of the topology in decentralized learning‘.
☆14Jun 7, 2022Updated 4 years ago
shevekk / QueryGraph
View on GitHub
☆11Feb 14, 2022Updated 4 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
OpenGVLab / MMT-Bench
View on GitHub
[ICML 2024] | MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI
☆119Apr 6, 2026Updated 3 months ago
neulab / MultiUI
View on GitHub
Code for Paper: Harnessing Webpage Uis For Text Rich Visual Understanding
☆54Dec 12, 2024Updated last year
HelmholtzAI-FZJ / flex_gen
View on GitHub
☆20Jan 10, 2025Updated last year
SciMT / SciMT-benchmark
View on GitHub
☆11Jan 3, 2024Updated 2 years ago
minhoooo1 / CatMAE
View on GitHub
CatMAE
☆15Dec 13, 2023Updated 2 years ago
peterljq / Parsimonious-Concept-Engineering
View on GitHub
PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)
☆43Jan 18, 2026Updated 6 months ago
locuslab / llava-token-compression
View on GitHub
☆47Nov 8, 2024Updated last year
RUCKBReasoning / DPO_Text2SQL
View on GitHub
[ACL 2025] Uncovering the Impact of Chain-of-Thought Reasoning for Direct Preference Optimization: Lessons from Text-to-SQL
☆16Oct 9, 2025Updated 9 months ago
declare-lab / della
View on GitHub
DELLA-Merging: Reducing Interference in Model Merging through Magnitude-Based Sampling
☆37Jul 12, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
uuujf / SGDNoise
View on GitHub
[ICML 2019] The Anisotropic Noise in Stochastic Gradient Descent: Its Behavior of Escaping from Sharp Minima and Regularization Effects
☆15Apr 12, 2020Updated 6 years ago
ZrrSkywalker / MAVIS
View on GitHub
[ICLR 2025] Mathematical Visual Instruction Tuning for Multi-modal Large Language Models
☆156Dec 5, 2024Updated last year
Olivia-fsm / DoGE
View on GitHub
Codebase for ICML submission "DOGE: Domain Reweighting with Generalization Estimation"
☆21Feb 29, 2024Updated 2 years ago
MathurUtkarsh / Video-Captioning-Using-LSTM-and-Keras
View on GitHub
Generating Video Caption Using LSTM
☆12May 29, 2023Updated 3 years ago
debayan / sigir2022-sparqlbaselines
View on GitHub
☆14Oct 28, 2022Updated 3 years ago
adlnlp / pdfvqa
View on GitHub
☆18Jun 12, 2024Updated 2 years ago
KGQA / KGQA-datasets-generalization
View on GitHub
Dataset Resplitting for Generalization in KGQA. See also https://github.com/semantic-systems/KGQA-datasets
☆17Jun 29, 2022Updated 4 years ago