[NAACL 2025] Guiding Large Language Models in Code Execution with Fine-grained Multimodal Chain-of-Thought Reasoning
☆12Feb 9, 2025Updated last year
Alternatives and similar repositories for VisualCoder
Users that are interested in VisualCoder are comparing it to the libraries listed below
Sorting:
- ☆11Sep 4, 2024Updated last year
- Generate the markdown version of your Vocabulary Builder in Kindle, and put it in your Obsidian Vault.☆20Aug 16, 2025Updated 6 months ago
- An Efficient & Standardized Benchmark Suite for Backdoor Attacks in Federated Learning☆48Nov 27, 2025Updated 3 months ago
- Python Control Flow Graph Generator☆19Feb 28, 2022Updated 4 years ago
- [FORGE 2025] Predicting Program Behavior with Dynamic Dependencies Learning☆28Aug 15, 2024Updated last year
- IBA: Towards Irreversible Backdoor Attacks in Federated Learning (Poster at NeurIPS 2023)☆40Sep 10, 2025Updated 5 months ago
- ☆11Aug 20, 2025Updated 6 months ago
- Automatically generates captions for an image using Image processing and NLP. Model was trained on Flickr30K dataset.☆11Jun 11, 2020Updated 5 years ago
- Official implementation of CMMCoT: Enhancing Complex Multi-Image Comprehension via Multi-Modal Chain-of-Thought and Memory Augmentation☆12Dec 5, 2025Updated 2 months ago
- Instance-Level Salient Object Detection, Computer Vision and Image Understanding (CVIU), 2021.☆12Apr 23, 2021Updated 4 years ago
- This repository is an official implementation of the paper A Simple Baseline for Open-World Tracking via Self-training.☆10Jan 26, 2024Updated 2 years ago
- ☆12Dec 20, 2024Updated last year
- ☆10Apr 10, 2019Updated 6 years ago
- ☆12Feb 27, 2025Updated last year
- Tis is code for Few-Shot Joint Multimodal Entity-Relation Extraction via Knowledge-Enhanced Cross-modal Prompt Model (ACM MM 2024))☆12Aug 27, 2024Updated last year
- Code for "Revisiting Batch Norm Initialization".☆12Jul 14, 2022Updated 3 years ago
- A collection of research papers related to Natural Language Reasoning☆11May 27, 2022Updated 3 years ago
- Tool to parse wiki tables from the HTML dump of Wikipedia☆11Jun 12, 2022Updated 3 years ago
- [AAAI 2024] MESED: A Multi-modal Entity Set Expansion Dataset with Fine-grained Semantic Classes and Hard Negative Entities☆16Apr 26, 2024Updated last year
- [NeurIPS 2024] A Large-Scale Human-Centric Benchmark for Referring Expression Comprehension in the LMM Era☆11Aug 6, 2024Updated last year
- The source of MNER-MI.☆19Dec 17, 2024Updated last year
- [EMNLP 2024] SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information☆12Oct 11, 2024Updated last year
- Fast integration of backdoor attacks in federated learning with updated attacks and defenses.☆58Jan 19, 2026Updated last month
- The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"☆21Jul 21, 2025Updated 7 months ago
- [EMNLP 2024] IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning☆15May 13, 2025Updated 9 months ago
- The official codes for Fast Monte Carlo Rendering via Multi-Resolution Sampling☆16Dec 2, 2021Updated 4 years ago
- Real Time Object Detection By Using YOLO to online shopping☆11Mar 17, 2019Updated 6 years ago
- This project is an attempt at performing color quantization using K-Means clustering. We also add our own touch by trying a different ini…☆15Jul 31, 2020Updated 5 years ago
- Python implementation of Medoidshift and Quickshift algorithms☆15Feb 5, 2015Updated 11 years ago
- An implementation of DSOD in Pytonch☆15Jul 13, 2018Updated 7 years ago
- A python tool that generate latex(e.g. Table, matrix) code.☆10Jun 22, 2022Updated 3 years ago
- [ACL 2024] Novel reranking method to select the best solutions for code generation☆16Jun 9, 2024Updated last year
- [EMNLP2022] Transformer-based Entity Typing in Knowledge Graphs☆16Nov 26, 2024Updated last year
- convert table or spreadsheet data into an image☆17Mar 30, 2023Updated 2 years ago
- Laplacian-Pyramid-Reconstruction-and-Refinement-for-Semantic-Segmentation in Pytorch☆12Nov 3, 2018Updated 7 years ago
- Support finetuning GLM4v with zero2☆16Jun 29, 2024Updated last year
- [ACL2024] Progressively Modality Freezing for Multi-Modal Entity Alignment☆19Apr 10, 2025Updated 10 months ago
- ☆17May 15, 2023Updated 2 years ago
- Implementation of the Grassberger-Procaccia algorithm to estimate the Correlation Dimension of a set of points☆17Jan 18, 2022Updated 4 years ago