Codebase for Inference-Time Policy Adapters
☆25Nov 3, 2023Updated 2 years ago
Alternatives and similar repositories for IPA
Users that are interested in IPA are comparing it to the libraries listed below
Sorting:
- ☆12Jul 25, 2023Updated 2 years ago
- A framework to train language models to learn invariant representations.☆14Jan 24, 2022Updated 4 years ago
- Code for Paper (Policy Optimization in RLHF: The Impact of Out-of-preference Data)☆28Dec 19, 2023Updated 2 years ago
- Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning☆26Jan 16, 2023Updated 3 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆45Oct 1, 2025Updated 5 months ago
- [ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities☆28Apr 2, 2025Updated 11 months ago
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- [ICML2023] Revisiting Data-Free Knowledge Distillation with Poisoned Teachers☆23Jul 7, 2024Updated last year
- Lightweight Adapting for Black-Box Large Language Models☆25Feb 15, 2024Updated 2 years ago
- Source code for the TMLR paper "Black-Box Prompt Learning for Pre-trained Language Models"☆57Sep 7, 2023Updated 2 years ago
- Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".☆22Sep 7, 2023Updated 2 years ago
- ☆24Aug 18, 2023Updated 2 years ago
- ☆30May 22, 2024Updated last year
- Source code of "Reasons to Reject? Aligning Language Models with Judgments"☆58Feb 29, 2024Updated 2 years ago
- ☆24Apr 29, 2022Updated 3 years ago
- The official implementation of Self-Exploring Language Models (SELM)☆63Jun 4, 2024Updated last year
- Restore safety in fine-tuned language models through task arithmetic☆32Mar 28, 2024Updated last year
- An implementation of the Residual Flow algorithm for out-of-distribution detection.☆31Apr 29, 2022Updated 3 years ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆134Nov 7, 2023Updated 2 years ago
- EOSIO-Taurus - The Most Powerful Infrastructure for Decentralized Applications☆13Mar 29, 2024Updated last year
- ☆14Updated this week
- ☆75Nov 3, 2023Updated 2 years ago
- Official implementation of ICLR'24 paper, "Curiosity-driven Red Teaming for Large Language Models" (https://openreview.net/pdf?id=4KqkizX…☆88Mar 15, 2024Updated last year
- ☆83Mar 24, 2023Updated 2 years ago
- ☆52Oct 23, 2023Updated 2 years ago
- Code for the paper "Semi-Conditional Normalizing Flows for Semi-Supervised Learning"☆11Mar 30, 2020Updated 5 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- NeurIPS 2023 paper: De novo Drug Design using Reinforcement Learning with Multiple GPT Agents☆38Mar 27, 2024Updated last year
- Introduction to Machine Learning using scikit-learn and PyTorch☆10Sep 26, 2019Updated 6 years ago
- Identification of the Adversary from a Single Adversarial Example (ICML 2023)☆10Jul 15, 2024Updated last year
- Firefox and Chrome compatible extension that acts as annotation tool for websites (Named Entity Recognition)☆10Feb 17, 2019Updated 7 years ago
- PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.☆12Nov 27, 2024Updated last year
- https://icml.cc/virtual/2023/poster/24354☆10Aug 15, 2023Updated 2 years ago
- CoCoFL: Communication- and Computation-Aware Federated Learning via Partial NN Freezing and Quantization☆13Aug 3, 2024Updated last year
- ☆13Oct 11, 2024Updated last year
- ☆11Jun 18, 2023Updated 2 years ago
- ☆38Apr 17, 2024Updated last year
- ☆10May 28, 2024Updated last year
- Source code for paper: Knowledge Inheritance for Pre-trained Language Models☆38Apr 24, 2022Updated 3 years ago