☆165May 28, 2025Updated 11 months ago
Alternatives and similar repositories for awesome-reward-models
Users that are interested in awesome-reward-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆34Oct 4, 2025Updated 7 months ago
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆30Oct 23, 2025Updated 6 months ago
- TARS: MinMax Token-Adaptive Preference Strategy for Hallucination Reduction in MLLMs☆24Sep 21, 2025Updated 7 months ago
- 😎 Awesome papers on token redundancy reduction☆11Mar 12, 2025Updated last year
- [ACL 2026 Main] Training, inference, and testing of the SAC speech codec model.☆101Nov 1, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆68Feb 4, 2026Updated 3 months ago
- Symmetrical Visual Contrastive Optimization: Aligning Vision-Language Models with Minimal Contrastive Images☆19Jun 4, 2025Updated 11 months ago
- ☆62Jun 17, 2024Updated last year
- ☆18Mar 2, 2026Updated 2 months ago
- ☆37Jun 18, 2025Updated 10 months ago
- Collection of papers for scalable automated alignment.☆93Oct 22, 2024Updated last year
- Official Repo of SimTeG☆43Mar 29, 2024Updated 2 years ago
- A Survey of Reinforcement Learning for Large Reasoning Models☆2,449Nov 9, 2025Updated 5 months ago
- The official repository of our paper "Reinforcing Video Reasoning with Focused Thinking"☆35Jun 12, 2025Updated 10 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- TopViewRS: Vision-Language Models as Top-View Spatial Reasoners (EMNLP 2024 Oral)☆15Jun 14, 2025Updated 10 months ago
- ☆13Apr 13, 2026Updated 3 weeks ago
- [ICLR 2026] Learning to Parallel: Accelerating Diffusion Large Language Models via Learnable Parallel Decoding☆32Jan 27, 2026Updated 3 months ago
- PiFlow: Principle-aware Scientific Discovery with Multi-Agent Collaboration☆43Jan 7, 2026Updated 3 months ago
- [ICLR 25 Oral] RM-Bench: Benchmarking Reward Models of Language Models with Subtlety and Style☆82Jul 18, 2025Updated 9 months ago
- About Code Release for "On the Embedding Collapse When Scaling Up Recommendation Models" (ICML 2024)☆29Aug 4, 2024Updated last year
- Explore how to get a VQ-VAE models efficiently!☆70Jul 24, 2025Updated 9 months ago
- ☆36May 24, 2024Updated last year
- ☆346May 24, 2025Updated 11 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆27Mar 4, 2025Updated last year
- FS-DFM: Fast and Accurate Long Text Generation with Few-Step Diffusion Language Models. FS-DFM accepted for ICLR 2026☆41Jan 6, 2026Updated 3 months ago
- Source code for NeurIPS 2022 paper "Uncovering the Structural Fairness in Graph Contrastive Learning"☆29Oct 25, 2022Updated 3 years ago
- ☆28Jul 11, 2024Updated last year
- Prompting Large Language Models with Audio for General-Purpose Speech Summarization☆20May 14, 2025Updated 11 months ago
- ☆26Sep 10, 2025Updated 7 months ago
- [TMLR] Process Reward Models That Think☆87Nov 29, 2025Updated 5 months ago
- [ICLR'26] RM-R1: Unleashing the Reasoning Potential of Reward Models☆163Jun 26, 2025Updated 10 months ago
- ☆28May 13, 2025Updated 11 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- [ICLR 2025] SuperCorrect: Advancing Small LLM Reasoning with Thought Template Distillation and Self-Correction☆89Mar 23, 2025Updated last year
- ☆24May 23, 2025Updated 11 months ago
- [ACL 2025 Findings] Text2World: Benchmarking Large Language Models for Symbolic World Model Generation☆29Feb 25, 2025Updated last year
- Official Implementation of Flash-Searcher: Fast and Effective Web Agents via DAG-Based Parallel Execution☆79Dec 8, 2025Updated 4 months ago
- The official implementation of InfoRM [NeurIPS 2024].☆15Oct 25, 2025Updated 6 months ago
- Source code of our EMNLP 2024 paper "FactAlign: Long-form Factuality Alignment of Large Language Models"☆19Oct 3, 2024Updated last year
- official implementation of MGA-CLAP (ACM MM 2024)☆31Oct 25, 2024Updated last year