pengshuai-rin / MultiMath
MultiMath: Bridging Visual and Mathematical Reasoning for Large Language Models
☆10Updated 2 weeks ago
Related projects: ⓘ
- Code for Math-LLaVA: Bootstrapping Mathematical Reasoning for Multimodal Large Language Models☆52Updated 2 months ago
- An Easy-to-use Hallucination Detection Framework for LLMs.☆48Updated 5 months ago
- ☆13Updated 10 months ago
- ☆110Updated last month
- A Neural-Symbolic Self-Training Framework☆95Updated last month
- An benchmark for evaluating the capabilities of large vision-language models (LVLMs)☆32Updated 10 months ago
- MATH-Vision dataset and code to measure Multimodal Mathematical Reasoning capabilities.☆53Updated 3 weeks ago
- ☆71Updated 8 months ago
- ☆70Updated 6 months ago
- Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent☆92Updated 2 months ago
- [ACL 2024] The official codebase for the paper "Self-Distillation Bridges Distribution Gap in Language Model Fine-tuning".☆81Updated this week
- A Synthetic, Scalable and Systematic Evaluation Suite for Large Language Models☆31Updated 3 months ago
- ☆40Updated 5 months ago
- ☆46Updated 2 weeks ago
- ☆31Updated 3 months ago
- A Survey on Benchmarks of Multimodal Large Language Models☆30Updated last month
- An LLM-free Multi-dimensional Benchmark for Multi-modal Hallucination Evaluation☆85Updated 8 months ago
- ☆11Updated 2 months ago
- This repo contains evaluation code for the paper "MileBench: Benchmarking MLLMs in Long Context"☆21Updated 2 months ago
- [ACL'2024 Findings] GAOKAO-MM: A Chinese Human-Level Benchmark for Multimodal Models Evaluation☆32Updated 6 months ago
- Towards Large Multimodal Models as Visual Foundation Agents☆87Updated 3 weeks ago
- Official repo for "AlignGPT: Multi-modal Large Language Models with Adaptive Alignment Capability"☆29Updated 2 months ago
- UniGen: A Unified Framework for Dataset Generation via Large Language Model☆21Updated 2 weeks ago
- Data for evaluating GPT-4V☆11Updated 10 months ago
- Codebase for ACL 2023 paper "Mixture-of-Domain-Adapters: Decoupling and Injecting Domain Knowledge to Pre-trained Language Models' Memori…☆44Updated 11 months ago
- Official repository for paper "Weak-to-Strong Extrapolation Expedites Alignment"☆62Updated 3 months ago
- This is the repo for our paper "Mr-Ben: A Comprehensive Meta-Reasoning Benchmark for Large Language Models"☆38Updated 2 months ago
- ChartMimic: Evaluating LMM’s Cross-Modal Reasoning Capability via Chart-to-Code Generation☆80Updated 2 months ago
- [ICML'2024] Can AI Assistants Know What They Don't Know?☆62Updated 7 months ago
- ☆27Updated last month