XanderJC/attention-based-credit

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/XanderJC/attention-based-credit)

XanderJC / attention-based-credit

Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt, and Mihaela van der Schaar

☆38

Alternatives and similar repositories for attention-based-credit

Users that are interested in attention-based-credit are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

EsYoon7 / RLHF-TLCR
View on GitHub
[ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"
☆12Dec 6, 2024Updated last year
morning9393 / ETPO
View on GitHub
☆14Mar 5, 2024Updated 2 years ago
tengxiao1 / SimPER
View on GitHub
SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)
☆17Aug 22, 2025Updated 11 months ago
cordercorder / knn-models
View on GitHub
A retrieval augmented sequence modeling toolkit implemented based on Fairseq
☆29Mar 3, 2023Updated 3 years ago
ShivankUdayawal / Regression-on-Car-Insurance-Dataset
View on GitHub
Predicting for Customers, whether they will buy car insurance or not.
☆11Jan 29, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
U-AIM-SW-STARLab / StarLab-Dialogue-System
View on GitHub
비디오 기반 인공지능 대화시스템
☆14Dec 23, 2023Updated 2 years ago
hee-suk-yoon / C-TPT
View on GitHub
[ICLR'24] Official code for "C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion"
☆23Jun 9, 2024Updated 2 years ago
ashshaksharifdeen / O-TPT
View on GitHub
CVPR'25 official code for O-TPT: Orthogonality Constraints for Calibrating Test-time Prompt Tuning in Vision-Language Models
☆16Sep 19, 2025Updated 10 months ago
Ryan-Rhys / Mrk_335
View on GitHub
Modelling the Multiwavelength Variability of Mrk-335 using Gaussian processes
☆12May 30, 2022Updated 4 years ago
counterfactual-ml / kdd2022-tutorial
View on GitHub
Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances
☆12Aug 14, 2022Updated 3 years ago
Vance0124 / Token-level-Direct-Preference-Optimization
View on GitHub
Reference implementation for Token-level Direct Preference Optimization(TDPO)
☆156Feb 14, 2025Updated last year
liziniu / cold_start_rl
View on GitHub
Code for Blog Post: Can Better Cold-Start Strategies Improve RL Training for LLMs?
☆20Mar 9, 2025Updated last year
RUCAIBox / FIGA
View on GitHub
[ICLR 2024] This is the official implementation for the paper: "Beyond imitation: Leveraging fine-grained quality signals for alignment"
☆10May 5, 2024Updated 2 years ago
clinicalml / gumbel-max-scm
View on GitHub
Code for "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models" (ICML 2019)
☆48Sep 28, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
NOHYC / autonomous_driving_car_project
View on GitHub
☆12Dec 5, 2021Updated 4 years ago
thunlp / Model_Emotion
View on GitHub
Neuron Activation
☆28Nov 21, 2024Updated last year
baoy-nlp / Latent-GLAT
View on GitHub
Implementation of latent-GLAT (ACL-2022)
☆34Apr 30, 2022Updated 4 years ago
launchnlp / LitCab
View on GitHub
☆25Jun 10, 2025Updated last year
FYQ0919 / PTSA-MCTS
View on GitHub
A PyTorch implementation of PTSA-MCTS from [Accelerating Monte Carlo Tree Search with Probability Tree State Abstraction].
☆16Oct 21, 2023Updated 2 years ago
thanhkaist / CCFDM1
View on GitHub
CCFDM reinforcement learning
☆40Dec 28, 2021Updated 4 years ago
jczhang02 / MUSIC_dataset_script
View on GitHub
This repo contains script to download MUSIC dataset from youtube
☆12Jan 19, 2024Updated 2 years ago
nick-jhlee / fair-manifold-pca
View on GitHub
Fast and Efficient MMD-based Fair PCA via Optimization over Stiefel Manifold (AAAI 2022)
☆11Sep 27, 2022Updated 3 years ago
ruqizhang / banditnet-pytorch
View on GitHub
A Pytorch implementation of "Deep Learning with Logged Bandit Feedback"
☆10Aug 22, 2018Updated 7 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Lemon-cmd / diffusion-jax
View on GitHub
Diffusion Probabilistic Model in Jax
☆13Apr 20, 2024Updated 2 years ago
thanhkaist / DimCL
View on GitHub
DimCL: Dimensional Contrastive Learning
☆30Dec 9, 2025Updated 7 months ago
neubig / util-scripts
View on GitHub
Various utility scripts useful for natural language processing, machine translation, etc.
☆51Mar 9, 2026Updated 4 months ago
liziniu / ReMax
View on GitHub
Code for Paper (ReMax: A Simple, Efficient and Effective Reinforcement Learning Method for Aligning Large Language Models)
☆202Dec 16, 2023Updated 2 years ago
pmichel31415 / translate
View on GitHub
Translate - a PyTorch Language Library
☆10Mar 14, 2019Updated 7 years ago
EsYoon7 / UVQA
View on GitHub
[ICLR'25] Official code for "Can Video LLMs Refuse to Answer? Alignment for Answerability in Video Large Language Models"
☆35Dec 26, 2025Updated 6 months ago
thanhkaist / mimo_Q_network
View on GitHub
Implementation of Uncertainty-Aware Rank-One MIMO Q Network Framework for Accelerated Offline Reinforcement Learning
☆32Apr 7, 2026Updated 3 months ago
FanmingL / SmartLogger
View on GitHub
☆12May 14, 2024Updated 2 years ago
MadryLab / rla
View on GitHub
Residue Level Alignment
☆22Nov 21, 2024Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
crowdAI / crowdai-criteo-ad-placement-challenge-starter-kit
View on GitHub
Starter kit for getting started in the NIPS 2017 Criteo Ad Placement Challenge
☆18Nov 10, 2017Updated 8 years ago
ReedZyd / GenerativeReturnDecomposition
View on GitHub
Source code for Interpretable Reward Redistribution in Reinforcement Learning: A Causal Approach (NeurIPS 2023)
☆10Dec 12, 2023Updated 2 years ago
HarrieO / 2022-SIGIR-plackett-luce
View on GitHub
☆12Jul 4, 2022Updated 4 years ago
thangvubk / SphereRPN
View on GitHub
☆39Dec 14, 2021Updated 4 years ago
cmu-mind / RISE
View on GitHub
☆34Oct 31, 2024Updated last year
ihaeyong / PFNR
View on GitHub
Winning SubNetwork (WSN), Fourier Subneural Operator (FSO), Video-Incremental Learning (VIL), Sequential Neural Implicit Representation (…
☆49Nov 19, 2024Updated last year
joechen24 / 16824-PyTorch-Weakly-Supervised-Detection
View on GitHub
16824 homework: weakly supervised object detection with PyTorch
☆13Sep 5, 2018Updated 7 years ago