☆34Apr 1, 2025Updated last year
Alternatives and similar repositories for R-PRM
Users that are interested in R-PRM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆46May 27, 2025Updated 11 months ago
- ☆47Jun 24, 2025Updated 10 months ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆64Mar 17, 2026Updated 2 months ago
- ☆16Jul 23, 2024Updated last year
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Feb 1, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A first bare bones paralleled implementation of Go Explore as described by the Uber Engineering blog post☆46Jan 25, 2019Updated 7 years ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆68Feb 5, 2025Updated last year
- ☆40Jan 23, 2024Updated 2 years ago
- A research project exploring fine-tuning BERT-style models for text generation☆40Nov 30, 2025Updated 5 months ago
- Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning", ICLR 2025☆34Apr 21, 2025Updated last year
- ☆21Oct 31, 2024Updated last year
- ☆29Mar 13, 2026Updated 2 months ago
- ☆15Sep 22, 2023Updated 2 years ago
- The CompCert formally-verified C compiler☆11May 14, 2026Updated last week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆19Jan 2, 2025Updated last year
- Code for "Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum" (ICML 2023)☆10Jul 6, 2023Updated 2 years ago
- The implementation for ICLR2023 paper: "BEEF: Bi-Compatible Class-Incremental Learning via Energy-Based Expansion and Fusion" in PyTorch.☆18May 25, 2023Updated 2 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- ☆16Jul 17, 2025Updated 10 months ago
- Code tasks for NJU TSA (Time Series Analysis) course☆13Jan 24, 2024Updated 2 years ago
- ☆13Jan 5, 2025Updated last year
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Oct 25, 2022Updated 3 years ago
- ☆30Dec 19, 2025Updated 5 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for the paper "A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis"☆20Jun 12, 2025Updated 11 months ago
- Crossmodal Translation based Meta Weight Adaption for Robust Image-Text Sentiment Analysis☆15May 16, 2024Updated 2 years ago
- [ACL 2023] Contextual Distortion Reveals Constituency: Mask Language Models are Implicit Parsers.☆14Jun 3, 2023Updated 2 years ago
- Official codebase for NeurIPS 2022 paper End-to-end Learning to Index and Search in Large Output Spaces☆12Apr 19, 2023Updated 3 years ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- An Empirical Study of Memorization in NLP (ACL 2022)☆13Jun 22, 2022Updated 3 years ago
- Code for "On Long-Tailed Phenomena in NMT".☆10Jan 10, 2021Updated 5 years ago
- Official Implementation for "Editing Massive Concepts in Text-to-Image Diffusion Models"☆19Mar 21, 2024Updated 2 years ago
- ☆55Jan 30, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆16Nov 20, 2025Updated 6 months ago
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago
- Model-Free-Episodic-Control implementation.☆17Jun 3, 2019Updated 6 years ago
- Official Repo for "Why Settle for One? Text-to-ImageSet Generation and Evaluation"☆21Oct 1, 2025Updated 7 months ago
- A Benchmark for Multi-Stage Legal Case Documents Generation☆18Feb 24, 2025Updated last year
- [IJCAI 2021] Solving Continuous Control with Episodic Memory☆15Apr 10, 2022Updated 4 years ago
- This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.☆17Sep 13, 2024Updated last year