Latest Advances on Reasoning of Multimodal Large Language Models (Multimodal R1 \ Visual R1) ) 🍓
☆36Apr 3, 2025Updated 11 months ago
Alternatives and similar repositories for Awesome-MLLM-Reasoning
Users that are interested in Awesome-MLLM-Reasoning are comparing it to the libraries listed below
Sorting:
- 北京交通大学计算机科学与技术专业的历年作业参考,严禁抄袭。/ Beijing Jiaotong University Computer Science and technology homework reference, no cheating.☆13Jul 2, 2022Updated 3 years ago
- R1-Vision: Let's first take a look at the image☆48Feb 16, 2025Updated last year
- 汇编语言代码,项目内容为微机原理与接口技术课程的实验内容。希望本项目,能对正在学习或者感兴趣汇编语言和微机原理与接口技术的你有所帮助~☆13Nov 1, 2019Updated 6 years ago
- ☆22Nov 19, 2024Updated last year
- [ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection☆25May 18, 2023Updated 2 years ago
- ☆49Aug 14, 2025Updated 6 months ago
- 在手写数字集MNIST上使用变分自动编码器作为encoder和decoder的ldm☆24May 23, 2024Updated last year
- A benchmark for the task of translation suggestion☆60Jun 23, 2022Updated 3 years ago
- [EMNLP 2025] Distill Visual Chart Reasoning Ability from LLMs to MLLMs☆59Aug 25, 2025Updated 6 months ago
- TransformerLight: A Novel Sequence Modeling Based Traffic Signaling Mechanism via Gated Transformer (29th ACM SIGKDD)☆31Aug 28, 2023Updated 2 years ago
- MMR1: Enhancing Multimodal Reasoning with Variance-Aware Sampling and Open Resources☆216Sep 26, 2025Updated 5 months ago
- MM-Eureka V0 also called R1-Multimodal-Journey, Latest version is in MM-Eureka☆324Jun 21, 2025Updated 8 months ago
- [ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-…☆80Jan 9, 2025Updated last year
- Wind Turbine Blade Image Dateset☆13May 23, 2019Updated 6 years ago
- ☆37Jun 28, 2021Updated 4 years ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆43Mar 11, 2025Updated 11 months ago
- Latest Advances on System-2 Reasoning☆1,329Jun 8, 2025Updated 8 months ago
- Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.☆841May 14, 2025Updated 9 months ago
- This repository contains the codes and data to replicate the experimental results in the manuscript Heterogeneous Graph Tree Networks.☆11Nov 23, 2022Updated 3 years ago
- Combined InstantID🔥 and FouriScale to generate high resolution image!☆11Apr 3, 2024Updated last year
- Ship remote sensing dataset☆12Jun 28, 2022Updated 3 years ago
- ☆10Mar 8, 2024Updated last year
- the datasets of our paper☆11Feb 26, 2024Updated 2 years ago
- Contains implementation of the DoubIL and ResiduIL algorithms from the ICML '22 paper Causal Imitation Learning under Temporally Correlat…☆11Dec 9, 2022Updated 3 years ago
- Official Implementation of MDK12-Bench: A Multi-Discipline Benchmark for Evaluating Reasoning in Multimodal Large Language Models☆12Nov 1, 2025Updated 4 months ago
- AdaptiveStep: Automatically Dividing Reasoning Step through Model Confidence☆10Mar 2, 2025Updated last year
- Awesome Entity Alignment is a collection of EA techniques, including papers, codes, and datasets.☆10Oct 27, 2022Updated 3 years ago
- Gesture Recognition Based on ALTERA DE2-115 FPGA☆10Mar 18, 2014Updated 11 years ago
- ☆10Oct 20, 2022Updated 3 years ago
- Pre-trained Wav2vec2.0 for Mandarin☆43Oct 30, 2022Updated 3 years ago
- ☆12Nov 19, 2024Updated last year
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆48Nov 8, 2023Updated 2 years ago
- 河海大学每日健康打卡☆12Dec 4, 2021Updated 4 years ago
- [NeurIPS 2024] Efficiency for Free: Ideal Data Are Transportable Representations☆19Jan 19, 2025Updated last year
- R functions and datasets related to the mapping of text to the United Nations 17 Sustainable Development Goals (SDGs).☆12May 12, 2022Updated 3 years ago
- chinese wwm masking and ngram masking based on jieba☆11Jul 25, 2019Updated 6 years ago
- ☆13Sep 25, 2024Updated last year
- Cell2location paper - Comprehensive mapping of tissue cell architecture via integrated single cell and spatial transcriptomics☆15Nov 26, 2022Updated 3 years ago
- Urban Generative Intelligence (UGI): A Foundational Platform for Embodied Agent and Future City☆12Dec 17, 2023Updated 2 years ago