tinnerhrhe / ROVERView external linksLinks
An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards
☆36Oct 3, 2025Updated 4 months ago
Alternatives and similar repositories for ROVER
Users that are interested in ROVER are comparing it to the libraries listed below
Sorting:
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆32Oct 13, 2025Updated 4 months ago
- [ICLR2026] The first W4A4KV4 quantized + 50% sparse LLMs!☆22Jan 26, 2026Updated 3 weeks ago
- minimal Energy-based transformer☆43Dec 11, 2025Updated 2 months ago
- The official implement of paper: Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers☆61Nov 21, 2025Updated 2 months ago
- Code for "Variational Reasoning for Language Models"☆56Sep 29, 2025Updated 4 months ago
- ☆10Mar 1, 2024Updated last year
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆65Jan 27, 2026Updated 2 weeks ago
- VideoNSA: Native Sparse Attention Scales Video Understanding☆78Nov 16, 2025Updated 3 months ago
- MiniMax-Provider-Verifier offers a rigorous, vendor-agnostic way to verify whether third-party deployments of the Minimax M2 model are co…☆23Jan 15, 2026Updated last month
- P1: Mastering Physics Olympiads with Reinforcement Learning☆73Dec 29, 2025Updated last month
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆42Jan 15, 2024Updated 2 years ago
- A ratatui based vertical and horizontal slider.☆35Jan 7, 2026Updated last month
- ☆29Jan 15, 2026Updated last month
- Language modeling with linear-cost context☆116Sep 25, 2025Updated 4 months ago
- Code for "RADSeg Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglomerative Models"☆28Jan 27, 2026Updated 2 weeks ago
- ☆10Sep 4, 2025Updated 5 months ago
- Spectral Sphere Optimizer☆96Jan 14, 2026Updated last month
- ☆14Mar 5, 2024Updated last year
- Conditional DDPM for characterizing radio sources from dirty images. (autumn 2023)☆11Nov 30, 2023Updated 2 years ago
- The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2…☆14Dec 7, 2024Updated last year
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Sep 28, 2024Updated last year
- ☆39Jan 16, 2026Updated last month
- ☆13Jun 22, 2025Updated 7 months ago
- ☆10Feb 12, 2024Updated 2 years ago
- Enemies for your LLM☆35Jan 20, 2026Updated 3 weeks ago
- Internal utility libraries for Pkl☆15Feb 4, 2026Updated last week
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- Official code repository for the paper titled "Efficient Molecular Conformer Generation with SO(3) Averaged Flow-Matching and Reflow" (IC…☆13Jan 8, 2026Updated last month
- Scripts for training Qwen 2.5 VL with ms-swift and GRPO☆12Feb 27, 2025Updated 11 months ago
- ☆12Aug 21, 2024Updated last year
- ☆13Jun 25, 2025Updated 7 months ago
- A simple, generic, and flexible keyframe animation library for Rust.☆30Dec 30, 2025Updated last month
- decontamination☆24Dec 3, 2025Updated 2 months ago
- Official implementation of ICML'24 paper "LQER: Low-Rank Quantization Error Reconstruction for LLMs"☆19Jul 11, 2024Updated last year
- ☆11Nov 30, 2023Updated 2 years ago
- ☆10Jun 14, 2024Updated last year
- ☆14May 21, 2024Updated last year
- Fuzzing solmate with medusa☆10Aug 14, 2023Updated 2 years ago
- Quora Paraphrasing Dataset Bahasa Indonesia Version☆11Apr 18, 2021Updated 4 years ago