Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt, and Mihaela van der Schaar
☆38Aug 11, 2024Updated last year
Alternatives and similar repositories for attention-based-credit
Users that are interested in attention-based-credit are comparing it to the libraries listed below
Sorting:
- SimPER: A Minimalist Approach to Preference Alignment without Hyperparameters (ICLR 2025)☆17Aug 22, 2025Updated 6 months ago
- ☆14Mar 5, 2024Updated 2 years ago
- Code release for paper "Autonomous Improvement of Instruction Following Skills via Foundation Models" | CoRL 2024☆76Oct 10, 2025Updated 4 months ago
- Counterfactual Evaluation and Learning for Interactive Systems: Foundations, Implementations, and Recent Advances☆12Aug 14, 2022Updated 3 years ago
- (WSDM2022 Best Paper Award Runner-Up) "Doubly Robust Off-Policy Evaluation for Ranking Policies under the Cascade Behavior Model"☆13Jul 16, 2023Updated 2 years ago
- A retrieval augmented sequence modeling toolkit implemented based on Fairseq☆29Mar 3, 2023Updated 3 years ago
- This is the pytorch implementation of the UAI2023 paper "A Trajectory is Worth Three Sentences: Multimodal Transformer for Offline Reinf…☆11Oct 9, 2023Updated 2 years ago
- This is the official code implementation of Bongard-OpenWorld (ICLR 2024).☆14Jan 6, 2025Updated last year
- A Pytorch implementation of "Deep Learning with Logged Bandit Feedback"☆10Aug 22, 2018Updated 7 years ago
- [ICML'25] Official code for "ConfPO: Exploiting Policy Model Confidence for Critical Token Selection in Preference Optimization"☆18Dec 22, 2025Updated 2 months ago
- 비디오 기반 인공지능 대화시스템☆14Dec 23, 2023Updated 2 years ago
- ☆12Jul 4, 2022Updated 3 years ago
- ☆16Nov 7, 2020Updated 5 years ago
- ☆18Apr 25, 2023Updated 2 years ago
- [ICLR 2025 Oral] Knowledge Entropy Decay during Language Model Pretraining Hinders New Knowledge Acquisition☆19Nov 25, 2024Updated last year
- Residue Level Alignment☆22Nov 21, 2024Updated last year
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆17Jan 8, 2025Updated last year
- [ICLR'24] Official code for "C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion"☆22Jun 9, 2024Updated last year
- (ICTIR2020) "Unbiased Pairwise Learning from Biased Implicit Feedback"☆19Nov 21, 2022Updated 3 years ago
- Starter kit for getting started in the NIPS 2017 Criteo Ad Placement Challenge☆18Nov 10, 2017Updated 8 years ago
- DimCL: Dimensional Contrastive Learning☆30Dec 9, 2025Updated 2 months ago
- ☆25Dec 12, 2025Updated 2 months ago
- Codebase for Paper Reusing Embeddings: Reproducible Reward Model Research in Large Language Model Alignment without GPUs☆22Apr 24, 2025Updated 10 months ago
- [ICLR'25] Official code for "Can Video LLMs Refuse to Answer? Alignment for Answerability in Video Large Language Models"☆34Dec 26, 2025Updated 2 months ago
- Code for "Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models" (ICML 2019)☆46Sep 28, 2020Updated 5 years ago
- Code for the experiments of Matrix Factorization Bandit☆24Feb 4, 2019Updated 7 years ago
- CFR implementation of a poker bot.☆12Feb 17, 2023Updated 3 years ago
- Policy Learning from Large Vision-Language Model Feedback Without Reward Modeling (IROS 2025)☆36Dec 26, 2025Updated 2 months ago
- ☆25Jun 10, 2025Updated 8 months ago
- ☆39Dec 21, 2024Updated last year
- ☆21Dec 22, 2020Updated 5 years ago
- Reference implementation for Token-level Direct Preference Optimization(TDPO)☆152Feb 14, 2025Updated last year
- Deep Reinforcement Learning by using Truly Proximal Policy Optimization in Tensorflow 2 and Pytorch☆22Nov 9, 2025Updated 3 months ago
- Neuron Activation☆26Nov 21, 2024Updated last year
- Implementation of variational autoencoders for collaborative filtering in PyTorch☆25May 13, 2019Updated 6 years ago
- Implementation for the research paper "Enhancing LLM Reasoning via Critique Models with Test-Time and Training-Time Supervision".☆55Nov 29, 2024Updated last year
- (SIGIR2020) “Asymmetric Tri-training for Debiasing Missing-Not-At-Random Explicit Feedback’’☆21Nov 21, 2022Updated 3 years ago
- (RecSys2020) "Doubly Robust Estimator for Ranking Metrics with Post-Click Conversions"☆24Mar 25, 2023Updated 2 years ago
- Pytorch implementation of HyperLLaVA: Dynamic Visual and Language Expert Tuning for Multimodal Large Language Models☆28Mar 22, 2024Updated last year