☆34Apr 1, 2025Updated last year
Alternatives and similar repositories for R-PRM
Users that are interested in R-PRM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A controlled benchmark on evaluating and studying the dynamics of Long Context Language Models☆26Oct 17, 2025Updated 7 months ago
- ☆47Jun 24, 2025Updated 11 months ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆69Mar 17, 2026Updated 2 months ago
- ☆45Jan 30, 2026Updated 4 months ago
- ☆16Jul 23, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆179Jun 1, 2026Updated last week
- This Machine Learning project deals with Coupon Recommendations based on Revenue Uplift☆11May 4, 2021Updated 5 years ago
- Provides a minimal implementation to extract FLAN datasets for further processing☆11Feb 1, 2023Updated 3 years ago
- Udemy☆12Sep 5, 2018Updated 7 years ago
- official implementation of paper "Process Reward Model with Q-value Rankings"☆69Feb 5, 2025Updated last year
- ☆40Jan 23, 2024Updated 2 years ago
- A research project exploring fine-tuning BERT-style models for text generation☆41Nov 30, 2025Updated 6 months ago
- Code release for "CURIE: Evaluating LLMs On Multitask Scientific Long Context Understanding and Reasoning", ICLR 2025☆34Apr 21, 2025Updated last year
- ☆21Oct 31, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AlphaGo Zero Reinforcement Learning Sokoban Solver☆11Jun 20, 2018Updated 7 years ago
- ☆29Mar 13, 2026Updated 2 months ago
- ☆15Sep 22, 2023Updated 2 years ago
- ☆13Sep 26, 2024Updated last year
- Code for "Demonstration-free Autonomous Reinforcement Learning via Implicit and Bidirectional Curriculum" (ICML 2023)☆10Jul 6, 2023Updated 2 years ago
- The implementation for ICLR2023 paper: "BEEF: Bi-Compatible Class-Incremental Learning via Energy-Based Expansion and Fusion" in PyTorch.☆18May 25, 2023Updated 3 years ago
- Official Implementation of "Learning to Refuse: Towards Mitigating Privacy Risks in LLMs"☆10Dec 13, 2024Updated last year
- ☆16Jul 17, 2025Updated 10 months ago
- ☆13Jan 5, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- A Controllable Model of Grounded Response Generation (AAAI 21)☆13Oct 25, 2022Updated 3 years ago
- Code for the paper "A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis"☆20Jun 12, 2025Updated 11 months ago
- Crossmodal Translation based Meta Weight Adaption for Robust Image-Text Sentiment Analysis☆15May 16, 2024Updated 2 years ago
- Implementation of the Playground environment from the paper Language as a Cognitive Tool to Imagine Goals inCuriosity-Driven Exploration.☆11Mar 5, 2021Updated 5 years ago
- Code for Beyond Generic: Enhancing Image Captioning with Real-World Knowledge using Vision-Language Pre-Training Model☆13Feb 15, 2024Updated 2 years ago
- Repo for webbook and materials of the course "Understanding LLMs" @ Uni Tübingen☆33May 5, 2026Updated last month
- An Empirical Study of Memorization in NLP (ACL 2022)☆13Jun 22, 2022Updated 3 years ago
- Code for "On Long-Tailed Phenomena in NMT".☆10Jan 10, 2021Updated 5 years ago
- ☆70Feb 4, 2026Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆11Nov 28, 2022Updated 3 years ago
- Code for BYOP [CVPR 2023]☆11Sep 25, 2023Updated 2 years ago
- Noise of Web (NoW) is a challenging noisy correspondence learning (NCL) benchmark containing 100K image-text pairs for robust image-text …☆16Nov 20, 2025Updated 6 months ago
- Code for the "Long Context Needs Some R&R" paper.☆12Mar 11, 2024Updated 2 years ago
- ☆13Jan 14, 2020Updated 6 years ago
- [IJCAI 2021] Solving Continuous Control with Episodic Memory☆15Apr 10, 2022Updated 4 years ago
- This repository contains the replication of the iGSM dataset generation process from the Physics of LLM paper by Zeyuan Zhu.☆17Sep 13, 2024Updated last year