Lauorie / DFTLinks
Reproduced the DFT method without using Verl. https://arxiv.org/abs/2508.05629
☆16Updated last week
Alternatives and similar repositories for DFT
Users that are interested in DFT are comparing it to the libraries listed below
Sorting:
- ☆29Updated last year
- Exploring Efficient Fine-Grained Perception of Multimodal Large Language Models☆63Updated 11 months ago
- ☆74Updated last year
- [ICML2025] The official implementation of "C-3PO: Compact Plug-and-Play Proxy Optimization to Achieve Human-like Retrieval-Augmented Gene…☆40Updated 5 months ago
- A Framework for Decoupling and Assessing the Capabilities of VLMs☆43Updated last year
- ☆79Updated last year
- ☆147Updated last week
- The code and data of We-Math, accepted by ACL 2025 main conference.☆133Updated last month
- 最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。☆44Updated 8 months ago
- Vision Search Assistant: Empower Vision-Language Models as Multimodal Search Engines☆126Updated 11 months ago
- The official repository of the dots.vlm1 instruct models proposed by rednote-hilab.☆260Updated 3 weeks ago
- A Simple MLLM Surpassed QwenVL-Max with OpenSource Data Only in 14B LLM.☆38Updated last year
- Copy the MLP of llama3 8 times as 8 experts , created a router with random initialization,add load balancing loss to construct an 8x8b Mo…☆27Updated last year
- Our 2nd-gen LMM☆34Updated last year
- ☆49Updated last year
- ☆296Updated 4 months ago
- [ICML 2025] |TokenSwift: Lossless Acceleration of Ultra Long Sequence Generation☆114Updated 5 months ago
- Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs☆91Updated 11 months ago
- ☆175Updated 8 months ago
- The code and data of We-Math 2.0.☆156Updated last month
- The official repo of One RL to See Them All: Visual Triple Unified Reinforcement Learning☆318Updated 4 months ago
- ☆90Updated last year
- Unleashing the Power of Cognitive Dynamics on Large Language Models☆63Updated last year
- (ICLR'25) A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents☆86Updated 8 months ago
- Official code of *Virgo: A Preliminary Exploration on Reproducing o1-like MLLM*☆109Updated 4 months ago
- [EMNLP'25] Code for paper "MT-R1-Zero: Advancing LLM-based Machine Translation via R1-Zero-like Reinforcement Learning"☆60Updated 6 months ago
- ☆50Updated 3 months ago
- The huggingface implementation of Fine-grained Late-interaction Multi-modal Retriever.☆98Updated 4 months ago
- Offical Repository of "AtomThink: Multimodal Slow Thinking with Atomic Step Reasoning"☆56Updated 2 months ago
- Baichuan-Omni: Towards Capable Open-source Omni-modal LLM 🌊☆269Updated 8 months ago