JacksonCakes / vision-r1Links
☆12Updated 7 months ago
Alternatives and similar repositories for vision-r1
Users that are interested in vision-r1 are comparing it to the libraries listed below
Sorting:
- CycleQD is a framework for parameter space model merging.☆44Updated 9 months ago
 - Train, tune, and infer Bamba model☆135Updated 4 months ago
 - KV Cache Steering for Inducing Reasoning in Small Language Models☆41Updated 3 months ago
 - A repository for research on medium sized language models.☆78Updated last year
 - ☆78Updated 2 months ago
 - Verifiers for LLM Reinforcement Learning☆77Updated 6 months ago
 - Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆28Updated 4 months ago
 - ☆57Updated last month
 - A testbed for agents and environments that can automatically improve models through data generation.☆27Updated 7 months ago
 - Lottery Ticket Adaptation☆40Updated 11 months ago
 - [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆108Updated 4 months ago
 - Matrix (Multi-Agent daTa geneRation Infra and eXperimentation framework) is a versatile engine for multi-agent conversational data genera…☆99Updated last week
 - [ACL 2024] Do Large Language Models Latently Perform Multi-Hop Reasoning?☆79Updated 7 months ago
 - Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆51Updated last year
 - ☆55Updated 11 months ago
 - SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆108Updated 10 months ago
 - OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 9 months ago
 - PyTorch implementation of models from the Zamba2 series.☆185Updated 9 months ago
 - ☆201Updated 10 months ago
 - Source code for the collaborative reasoner research project at Meta FAIR.☆103Updated 6 months ago
 - ☆53Updated 8 months ago
 - ☆24Updated 7 months ago
 - ☆85Updated 4 months ago
 - accompanying material for sleep-time compute paper☆117Updated 6 months ago
 - ☆14Updated last year
 - ☆80Updated 2 weeks ago
 - ☆85Updated last week
 - ☆93Updated 4 months ago
 - Official repo of paper LM2☆47Updated 8 months ago
 - Code and data for the paper "Why think step by step? Reasoning emerges from the locality of experience"☆62Updated 6 months ago