wwzhuang01 / Math-PUMAView external linksLinks
[AAAI 2025]Math-PUMA: Progressive Upward Multimodal Alignment to Enhance Mathematical Reasoning
☆42Apr 14, 2025Updated 10 months ago
Alternatives and similar repositories for Math-PUMA
Users that are interested in Math-PUMA are comparing it to the libraries listed below
Sorting:
- [ACL 2024] Multi-modal preference alignment remedies regression of visual instruction tuning on language model☆47Nov 10, 2024Updated last year
- ☆14Dec 18, 2024Updated last year
- The implement of geometric solver PGPSNet☆30Jan 30, 2025Updated last year
- A pipeline for the automatic construction of geometry problems along with step-by-step solutions.☆16Aug 27, 2025Updated 5 months ago
- This repo is the official implementation of "Euclid’s Gift: Enhancing Spatial Perception and Reasoning in Vision‑Language Models via Geom…☆26Nov 7, 2025Updated 3 months ago
- ☆128Sep 20, 2025Updated 4 months ago
- [ACL 2023] PuMer: Pruning and Merging Tokens for Efficient Vision Language Models☆36Oct 3, 2024Updated last year
- [ICLR'25] Geometric Problem Solving Through Unified Formalized Vision-Language Pre-training☆47Jan 25, 2025Updated last year
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆20May 27, 2025Updated 8 months ago
- This is the Repository for Geometry Problem Solving Method Evaluation☆26Oct 8, 2024Updated last year
- ☆23Aug 17, 2024Updated last year
- Weakly opinionated library for implementing ML models. Less boilerplate, More rigor☆21Jul 1, 2022Updated 3 years ago
- The first end-to-end deep learning model for explicit plane geometry diagram parsing.☆57Dec 18, 2024Updated last year
- Formal representation and solving for Euclidean plane geometry problems.☆32Dec 19, 2025Updated last month
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆28Jul 9, 2025Updated 7 months ago
- [CVPR 2025] PyTorch implementation of paper "FLAME: Frozen Large Language Models Enable Data-Efficient Language-Image Pre-training"☆32Jul 8, 2025Updated 7 months ago
- The SVO-Probes Dataset for Verb Understanding☆31Jan 28, 2022Updated 4 years ago
- Paper collections of multi-modal LLM for Math/STEM/Code.☆136Nov 17, 2025Updated 2 months ago
- ACL'24 (Oral) Tuning Large Multimodal Models for Videos using Reinforcement Learning from AI Feedback☆76Sep 12, 2024Updated last year
- 表格线检测☆27Sep 3, 2019Updated 6 years ago
- [CVPR'24] RLHF-V: Towards Trustworthy MLLMs via Behavior Alignment from Fine-grained Correctional Human Feedback☆306Sep 11, 2024Updated last year
- A lightweight driving simulator, written in Julia.☆19Sep 25, 2024Updated last year
- WikiTableSet: A largest publicly available image-based table recognition dataset in three languages built from Wikipedia☆32Jun 12, 2025Updated 8 months ago
- Advanced Embodied Intelligence Brain Model☆33Nov 5, 2025Updated 3 months ago
- ☆47May 25, 2025Updated 8 months ago
- [ICLR '25] Official Pytorch implementation of "Interpreting and Editing Vision-Language Representations to Mitigate Hallucinations"☆95Nov 30, 2025Updated 2 months ago
- ☆111Jan 8, 2025Updated last year
- [ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"☆259Apr 14, 2025Updated 10 months ago
- Dataset introduced in PlotQA: Reasoning over Scientific Plots☆82Jun 20, 2023Updated 2 years ago
- Automatically generates captions for an image using Image processing and NLP. Model was trained on Flickr30K dataset.☆11Jun 11, 2020Updated 5 years ago
- This repository collects awesome representative papers and resources for "From Pre-training to Post-training: A Survey on Time Series Fou…☆30Feb 1, 2026Updated last week
- (NeurIPS 2025) LaRes: Evolutionary Reinforcement Learning with LLM-based Adaptive Reward Search☆20Feb 3, 2026Updated last week
- Website nhận diện và trích xuất thông tin từ Chứng Minh Nhân Dân☆11Oct 6, 2022Updated 3 years ago
- [ACM MM 2023] The released code of paper "Deconfounded Visual Question Generation with Causal Inference"☆11Sep 3, 2024Updated last year
- ICDAR 2024 Table OCR Model☆39Feb 4, 2026Updated last week
- MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning☆768Sep 7, 2025Updated 5 months ago
- ☆12Feb 27, 2025Updated 11 months ago
- Code release for "Category-Specific Prompts for Animal Action Recognition with Pretrained Vision-Language Models"☆14Feb 21, 2024Updated last year
- [NAACL 2025] Guiding Large Language Models in Code Execution with Fine-grained Multimodal Chain-of-Thought Reasoning☆12Feb 9, 2025Updated last year