An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards
☆37Oct 3, 2025Updated 5 months ago
Alternatives and similar repositories for ROVER
Users that are interested in ROVER are comparing it to the libraries listed below
Sorting:
- ROSA+: RWKV's ROSA implementation with fallback statistical predictor☆34Oct 13, 2025Updated 4 months ago
- [ICLR2026] The first W4A4KV4 quantized + 50% sparse LLMs!☆24Jan 26, 2026Updated last month
- minimal Energy-based transformer☆43Dec 11, 2025Updated 2 months ago
- The official implement of paper: Pluggable Pruning with Contiguous Layer Distillation for Diffusion Transformers☆63Nov 21, 2025Updated 3 months ago
- Code for "Variational Reasoning for Language Models"☆56Sep 29, 2025Updated 5 months ago
- ☆10Mar 1, 2024Updated 2 years ago
- [ICLR 2026] RPG: KL-Regularized Policy Gradient (https://arxiv.org/abs/2505.17508)☆65Feb 19, 2026Updated 2 weeks ago
- MiniMax-Provider-Verifier offers a rigorous, vendor-agnostic way to verify whether third-party deployments of the Minimax M2 model are co…☆29Feb 18, 2026Updated 2 weeks ago
- 开放信号聚合ensemble框架。☆29Feb 11, 2026Updated 3 weeks ago
- Repository for Sparse Finetuning of LLMs via modified version of the MosaicML llmfoundry☆42Jan 15, 2024Updated 2 years ago
- P1: Mastering Physics Olympiads with Reinforcement Learning☆76Dec 29, 2025Updated 2 months ago
- A framework aiming to bridge fast robot prototyping, predefined motion primitives, heterogeneous teleoperation, data collection, and flex…☆22Updated this week
- Enemies for your LLM☆35Jan 20, 2026Updated last month
- ☆10Sep 4, 2025Updated 6 months ago
- VideoNSA: Native Sparse Attention Scales Video Understanding☆81Nov 16, 2025Updated 3 months ago
- ☆29Jan 15, 2026Updated last month
- Language modeling with linear-cost context☆115Sep 25, 2025Updated 5 months ago
- ☆10Jun 14, 2024Updated last year
- 为 RWKV 设计的「Deep Think」实现。☆25Dec 7, 2025Updated 3 months ago
- ☆13Jun 25, 2025Updated 8 months ago
- Conditional DDPM for characterizing radio sources from dirty images. (autumn 2023)☆11Nov 30, 2023Updated 2 years ago
- ☆16Jan 29, 2026Updated last month
- Quora Paraphrasing Dataset Bahasa Indonesia Version☆11Apr 18, 2021Updated 4 years ago
- [CVPR'26 Findings] Source code for "RADSeg Unleashing Parameter and Compute Efficient Zero-Shot Open-Vocabulary Segmentation Using Agglom…☆33Feb 24, 2026Updated last week
- Tiny evaluation of leading LLMs on competitive programming problems☆14Nov 28, 2024Updated last year
- Code of the paper "Synthesizing Aspect-Driven Recommendation Explanations from Reviews", IJCAI'20☆10Apr 5, 2024Updated last year
- Official implementation of ICML'24 paper "LQER: Low-Rank Quantization Error Reconstruction for LLMs"☆19Jul 11, 2024Updated last year
- A simple, generic, and flexible keyframe animation library for Rust.☆30Dec 30, 2025Updated 2 months ago
- My collection of dotfiles☆15Feb 16, 2026Updated 2 weeks ago
- Pytorch implementation of WGAN with gradient penalty (WGAN-GP),☆12Feb 7, 2022Updated 4 years ago
- The official code for "Advancing Multimodal Large Language Models with Quantization-Aware Scale Learning for Efficient Adaptation" | [MM2…☆14Dec 7, 2024Updated last year
- Scripts for training Qwen 2.5 VL with ms-swift and GRPO☆12Feb 27, 2025Updated last year
- ☆10Feb 12, 2024Updated 2 years ago
- A ratatui based vertical and horizontal slider.☆37Feb 26, 2026Updated last week
- ☆40Jan 16, 2026Updated last month
- ☆14Mar 5, 2024Updated 2 years ago
- Official code for "Algorithmic Capabilities of Random Transformers" (NeurIPS 2024)☆16Sep 28, 2024Updated last year
- ☆14May 21, 2024Updated last year
- Internal utility libraries for Pkl☆15Updated this week