JacksonCakes / vision-r1Links
☆12Updated 6 months ago
Alternatives and similar repositories for vision-r1
Users that are interested in vision-r1 are comparing it to the libraries listed below
Sorting:
- CycleQD is a framework for parameter space model merging.☆44Updated 8 months ago
- ☆77Updated last month
- A repository for research on medium sized language models.☆78Updated last year
- KV Cache Steering for Inducing Reasoning in Small Language Models☆40Updated 2 months ago
- Train, tune, and infer Bamba model☆133Updated 4 months ago
- ☆55Updated 11 months ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆104Updated 10 months ago
- ☆78Updated 3 weeks ago
- Verifiers for LLM Reinforcement Learning☆74Updated 5 months ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆51Updated 11 months ago
- Lottery Ticket Adaptation☆40Updated 10 months ago
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆27Updated 4 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆27Updated 7 months ago
- An AI benchmark for creative, human-like problem solving using Sudoku variants☆102Updated 2 months ago
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆172Updated 8 months ago
- ☆85Updated last year
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆61Updated 10 months ago
- Plug in & Play Pytorch Implementation of the paper: "Evolutionary Optimization of Model Merging Recipes" by Sakana AI☆29Updated 11 months ago
- ☆22Updated 2 months ago
- ☆57Updated last week
- ☆35Updated 4 months ago
- ☆14Updated last year
- ☆67Updated 6 months ago
- This is the official repository for Inheritune.☆115Updated 8 months ago
- A simplified implementation for experimenting with RLVR on GSM8K, This repository provides a starting point for exploring reasoning.☆129Updated 8 months ago
- Synthetic data generation and benchmark implementation for "Episodic Memories Generation and Evaluation Benchmark for Large Language Mode…☆55Updated last week
- Simple repository for training small reasoning models☆40Updated 8 months ago
- ☆199Updated 10 months ago
- PyTorch implementation of models from the Zamba2 series.☆185Updated 8 months ago
- Official repo of paper LM2☆45Updated 7 months ago