☆12Jun 12, 2024Updated last year
Alternatives and similar repositories for VIM_TOOL
Users that are interested in VIM_TOOL are comparing it to the libraries listed below
Sorting:
- [ Arxiv 2023 ] This repository contains the code for "MUPPET: Multi-Modal Few-Shot Temporal Action Detection"☆15Aug 30, 2023Updated 2 years ago
- TC-bot using Attention-based Recurrent Neural Network (NLU) and SC-LSTM (NLG)☆14Jan 17, 2018Updated 8 years ago
- Prompt Free, Soul Driven AI Assistant☆28Feb 19, 2026Updated last month
- Improving Language Understanding from Screenshots. Paper: https://arxiv.org/abs/2402.14073☆31Jul 9, 2024Updated last year
- Can VLMs understand students' hand-drawn math work?☆17Jan 20, 2026Updated 2 months ago
- ☆14Jan 6, 2025Updated last year
- [EMNLP 2025] The official implementation of "Zero-shot Multimodal Document Retrieval via Cross-Modal Question Generation"☆15Aug 26, 2025Updated 6 months ago
- code for EMNLP 2024 paper: Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis☆12Nov 17, 2024Updated last year
- Official implement of ACL'25 Findings paper "MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Lang…☆21Jun 17, 2025Updated 9 months ago
- Official Implementation for the paper "VisCodex: Unified Multimodal Code Generation via Merging Vision and Coding Models"☆22Aug 14, 2025Updated 7 months ago
- 2D Vector-Quantized Auto-Encoder for compression of Whole-Slide Images in Histopathology☆16Jul 18, 2024Updated last year
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago
- Reasoning or Memorization? Unreliable Results of Reinforcement Learning Due to Data Contamination.☆22Jul 18, 2025Updated 8 months ago
- ☆14Jun 10, 2025Updated 9 months ago
- Official implementation of the ICCV 2021 paper "Joint Inductive and Transductive Learning for Video Object Segmentation"☆32Aug 27, 2021Updated 4 years ago
- ✱ Understanding the underlying learning dynamics of simple tasks in Transformer networks☆18Aug 16, 2024Updated last year
- Code release for "Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models"☆14Feb 21, 2024Updated 2 years ago
- A collection of survey papers and resources related to Large Language Models (LLMs).☆40Feb 6, 2024Updated 2 years ago
- [ICCV 2021] Multimodal Knowledge Expansion☆10Aug 28, 2021Updated 4 years ago
- An experiment to see if chatgpt can improve the output of the stanford alpaca dataset☆12Mar 29, 2023Updated 2 years ago
- Repository of PIXAR, a Pixel-based Auto-Regressive Language Model☆18Sep 15, 2025Updated 6 months ago
- The Source Code for OmniVideoBench @ICLR 2026☆64Feb 12, 2026Updated last month
- [WWW24-UrbanCLIP] A comprehensive toolkit designed to facilitate the collection, processing, and integration of satellite imagery and ass…☆17Oct 6, 2024Updated last year
- The code used to train and run inference with MMDocIR☆32May 29, 2025Updated 9 months ago
- Code for the CVPR 2020 oral paper: Weakly Supervised Visual Semantic Parsing☆33Dec 8, 2022Updated 3 years ago
- Code for the paper "A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis"☆19Jun 12, 2025Updated 9 months ago
- ☆16Aug 1, 2024Updated last year
- We introduce Chart2Code, the first user-driven, hierarchical benchmark that systematically evaluates Large Multimodal Models on chart-to-…☆24Jan 27, 2026Updated last month
- Video-Text Representation Learning via Differentiable Weak Temporal Alignment (PyTorch implementation for the CVPR 2022 paper)☆11Oct 12, 2022Updated 3 years ago
- ☆22Mar 19, 2024Updated 2 years ago
- Project page for the ICDAR 2023 Paper "Inv3D: a high-resolution 3D invoice dataset for template-guided single-image document unwarping".☆13Dec 21, 2023Updated 2 years ago
- [ACM MM 2022] MM_Pyramid: Multimodal Pyramid Attentional Network for Audio-Visual Event Localization and Video Parsing☆16Aug 26, 2022Updated 3 years ago
- Jina VDR is a multilingual, multi-domain benchmark for visual document retrieval☆38Aug 4, 2025Updated 7 months ago
- how to build up Knowledge graph☆12Nov 16, 2021Updated 4 years ago
- Calibrating LLM Confidence by Probing Perturbed Representation Stability☆17Jul 5, 2025Updated 8 months ago
- ☆14Apr 21, 2023Updated 2 years ago
- ☆16Oct 21, 2024Updated last year
- ☆14Nov 13, 2023Updated 2 years ago
- ☆17Oct 22, 2024Updated last year