A programmatic instruction template generator aiming at enhancing the understanding of the critical role instruction templates play in large Multimodal Language Model (MLM) evaluation and training.
☆15Dec 22, 2024Updated last year
Alternatives and similar repositories for TemplateMatters
Users that are interested in TemplateMatters are comparing it to the libraries listed below
Sorting:
- ☆35Feb 15, 2026Updated last month
- Code and Data for Paper: SELMA: Learning and Merging Skill-Specific Text-to-Image Experts with Auto-Generated Data☆35Mar 12, 2024Updated 2 years ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- https://arxiv.org/abs/2404.10917☆14Mar 18, 2025Updated last year
- Unofficial Pytorch implementation of the paper 'Categorical Reparameterization with Gumbel-Softmax' and 'The Concrete Distribution: A Con…☆11Apr 27, 2021Updated 4 years ago
- ☆42Feb 12, 2026Updated last month
- Training Autoregressive Image Generation models via Reinforcement Learning☆51Nov 26, 2025Updated 3 months ago
- [ICCV 2023] Subclass-balancing contrastive learning for long-tailed recognition☆18Oct 30, 2023Updated 2 years ago
- The official implementation of "XSkill: Continual Learning from Experience and Skills in Multimodal Agents"☆77Updated this week
- Agent Skill Induction: "Inducing Programmatic Skills for Agentic Tasks"☆39Apr 24, 2025Updated 10 months ago
- [ICCV 2023] MADAug: When to Learn What: Model-Adaptive Data Augmentation Curriculum☆19Nov 9, 2023Updated 2 years ago
- Training and evaluating self-supervised deep neural networks☆27Sep 10, 2017Updated 8 years ago
- AbstainQA, ACL 2024☆29Feb 4, 2026Updated last month
- Official code for "Iterative Regularized Policy Optimization with Imperfect Demonstrations" (ICML2024).☆29Nov 13, 2025Updated 4 months ago
- [ICLR 2023] MultiViz: Towards Visualizing and Understanding Multimodal Models☆99Aug 22, 2024Updated last year
- ☆213Dec 19, 2025Updated 3 months ago
- NaturalCodeBench (Findings of ACL 2024)☆68Oct 14, 2024Updated last year
- A version of verl to support diverse tool use☆911Mar 2, 2026Updated 2 weeks ago
- ☆135Jun 25, 2024Updated last year
- Code and trained models for our paper: K. Triaridis, V. Mezaris, "Exploring Multi-Modal Fusion for Image Manipulation Detection and Local…☆103Apr 1, 2024Updated last year
- [NeurIPS 2023] A faithful benchmark for vision-language compositionality☆89Feb 13, 2024Updated 2 years ago
- TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering☆182Apr 29, 2024Updated last year
- Is synthetic data from generative models ready for image recognition?☆186Feb 16, 2023Updated 3 years ago
- [ICLR'26, NAACL'25 Demo] Toolkit & Benchmark for evaluating the trustworthiness of generative foundation models.☆128Aug 22, 2025Updated 7 months ago
- Official code for Paper "Mantis: Multi-Image Instruction Tuning" [TMLR 2024 Best Paper]☆239Jan 3, 2026Updated 2 months ago
- [AAAI-2026] Code for "UI-R1: Enhancing Efficient Action Prediction of GUI Agents by Reinforcement Learning"☆149Nov 24, 2025Updated 3 months ago
- Self-training with Weak Supervision (NAACL 2021)☆163Jul 24, 2023Updated 2 years ago
- Fully Open Framework for Democratized Multimodal Training☆770Dec 27, 2025Updated 2 months ago
- TruFor☆237May 29, 2025Updated 9 months ago
- ☆212Dec 23, 2025Updated 2 months ago
- A curated list of programmatic weak supervision papers and resources☆191Mar 1, 2023Updated 3 years ago
- [NeurIPS 2021] WRENCH: Weak supeRvision bENCHmark☆227Feb 13, 2024Updated 2 years ago
- A minimal codebase for finetuning large multimodal models, supporting llava-1.5/1.6, llava-interleave, llava-next-video, llava-onevision,…☆368Feb 28, 2026Updated 3 weeks ago
- Visualizing the attention of vision-language models☆285Feb 28, 2025Updated last year
- More interactive weak supervision with FlyingSquid☆316Sep 1, 2020Updated 5 years ago
- A RocksDB plugin for key-value separation, inspired by WiscKey.☆514Aug 6, 2025Updated 7 months ago
- Awesome Unified Multimodal Models☆1,152Feb 6, 2026Updated last month
- GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning☆2,227Updated this week
- Four landmark detection algorithms, implemented in PyTorch.☆923Jan 26, 2021Updated 5 years ago