☆24Feb 18, 2026Updated last week
Alternatives and similar repositories for rl-rewardhacking
Users that are interested in rl-rewardhacking are comparing it to the libraries listed below
Sorting:
- ☆13Aug 9, 2023Updated 2 years ago
- Analyze AI agent trajectories: extract actions, summarize, embed, and visualize.☆93Feb 20, 2026Updated last week
- This was designed for interp researchers who want to do research on or with interp agents to give quality of life improvements and fix …☆127Feb 8, 2026Updated 3 weeks ago
- Plan✕ is a platform for creating and publishing digital planning services☆17Updated this week
- A toolbox with the goal of speeding up research on bargaining in MARL (cooperation problems in MARL).☆32Sep 29, 2022Updated 3 years ago
- This repository contains the Parasol processor, which enables next-generation privacy preserving applications. Users can run arbitrary co…☆11Updated this week
- Direct transcription of an optimal control problem and resolution☆12Updated this week
- Machine Learning for Mathematical Formalization☆11Jul 20, 2024Updated last year
- [ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"☆10May 5, 2024Updated last year
- Python platform for parallel Surrogate-Based Optimization☆12Nov 27, 2024Updated last year
- EmotionCircuits-LLM: A complete, reproducible framework for discovering and controlling emotion circuits in large language models.☆25Oct 20, 2025Updated 4 months ago
- Code for experiments on self-prediction as a way to measure introspection in LLMs☆16Dec 10, 2024Updated last year
- [KDD'23] This is the code repo for our KDD'23 paper "DyGen: Learning from Noisy Labels via Dynamics-Enhanced Generative Modeling".☆11Jun 14, 2023Updated 2 years ago
- ☆15Sep 7, 2025Updated 5 months ago
- LLM plugin to generate plugins for LLM☆13Dec 30, 2024Updated last year
- A model-based API Fuzzer for SMT Solvers.☆15Oct 14, 2025Updated 4 months ago
- Experiments with reasoning models, training techniques, papers☆24Feb 24, 2026Updated last week
- Manual Baseline Models☆10Nov 7, 2024Updated last year
- Find bottlenecks in your test suites☆17Feb 23, 2026Updated last week
- Website for the Research Data Management Librarian Academy☆18Feb 3, 2026Updated last month
- Use GPT-3 to generate competitive programming ideas.☆11Feb 29, 2024Updated 2 years ago
- ACE (Adaptive Code Evolution) is an AI-powered system for code analysis and optimization.☆12Nov 4, 2025Updated 3 months ago
- Analyzes whole genome sequencing data for gene-editing verification☆10Feb 6, 2026Updated 3 weeks ago
- ☆18May 3, 2025Updated 10 months ago
- A package to set up, run, and store simulation campaigns in PhysiCell.☆13Feb 20, 2026Updated last week
- ☆16Jan 29, 2026Updated last month
- Driver for coupled AMR-Wind/Nalu-Wind simulations☆13Nov 10, 2025Updated 3 months ago
- [ICML2025] Official codebase for "TeLoGraF: Temporal Logic Planning via Graph-encoded Flow Matching"☆19Jul 14, 2025Updated 7 months ago
- Relational Features for Planning☆14Feb 18, 2026Updated 2 weeks ago
- ☆40Updated this week
- [ICLR 2025] FLAT: LLM Unlearning via Loss Adjustment with Only Forget Data☆14Feb 26, 2025Updated last year
- Public code release for the paper "Reawakening knowledge: Anticipatory recovery from catastrophic interference via structured training"☆11Oct 27, 2025Updated 4 months ago
- Given a Substack newsletter, save the contents into an sqlite db and format it as an epub☆13Jan 11, 2024Updated 2 years ago
- Transform messy HTML from Google Docs into well-structured HTML!☆14Jul 10, 2025Updated 7 months ago
- ☆15Jun 30, 2025Updated 8 months ago
- ☆14Dec 12, 2023Updated 2 years ago
- ☆13Apr 7, 2024Updated last year
- ☆10Nov 6, 2024Updated last year
- ehrQL: the electronic health record query language for OpenSAFELY☆10Updated this week