JacksonCakes / vision-r1Links
☆12Updated 9 months ago
Alternatives and similar repositories for vision-r1
Users that are interested in vision-r1 are comparing it to the libraries listed below
Sorting:
- CycleQD is a framework for parameter space model merging.☆46Updated 11 months ago
- SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Model https://arxiv.org/pdf/2411.02433☆115Updated last year
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆31Updated 7 months ago
- Train, tune, and infer Bamba model☆137Updated 7 months ago
- GoldFinch and other hybrid transformer components☆45Updated last year
- ☆21Updated 5 months ago
- Simple repository for training small reasoning models☆47Updated 11 months ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆62Updated last year
- OpenCoconut implements a latent reasoning paradigm where we generate thoughts before decoding.☆174Updated 11 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆62Updated last year
- ☆24Updated 9 months ago
- ☆82Updated last month
- Lottery Ticket Adaptation☆40Updated last year
- ☆91Updated last year
- ☆55Updated last year
- THOUGHTSCULPT, a general reasoning and search method for complex tasks☆13Updated last year
- A repository for research on medium sized language models.☆77Updated last year
- This is the repo for the paper "PANGEA: A FULLY OPEN MULTILINGUAL MULTIMODAL LLM FOR 39 LANGUAGES"☆117Updated 6 months ago
- A testbed for agents and environments that can automatically improve models through data generation.☆27Updated 10 months ago
- Learning to Retrieve by Trying - Source code for Grounding by Trying: LLMs with Reinforcement Learning-Enhanced Retrieval☆51Updated last year
- Repository for the Q-Filters method (https://arxiv.org/pdf/2503.02812)☆35Updated 10 months ago
- Verifiers for LLM Reinforcement Learning☆80Updated 8 months ago
- Model Merging with Functional Dual Anchors☆44Updated last month
- ☆48Updated last year
- Official Project Page for "Web World Models" (https://arxiv.org/abs/2512.23676)☆47Updated last week
- PyTorch implementation of models from the Zamba2 series.☆186Updated 11 months ago
- ☆58Updated 10 months ago
- [ACL 2025] Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems☆119Updated 6 months ago
- ☆35Updated 7 months ago
- Official PyTorch Implementation for Vision-Language Models Create Cross-Modal Task Representations, ICML 2025☆31Updated 8 months ago