minaek/reward_design_with_llms

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/minaek/reward_design_with_llms)

minaek / reward_design_with_llms

☆222

Alternatives and similar repositories for reward_design_with_llms

Users that are interested in reward_design_with_llms are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ademiadeniji / lamp
View on GitHub
☆47Jan 29, 2024Updated 2 years ago
PSI-Intention2022 / PSI-Competition
View on GitHub
Contains scripts for the PSI competition.
☆11Dec 11, 2023Updated 2 years ago
wuxiyang1996 / Heterogeneous_Highway_Env
View on GitHub
Heterogeneous Multi-agent Version of Highway-env
☆18Jun 28, 2023Updated 3 years ago
hengyuan-hu / instruct-rl
View on GitHub
☆16Feb 23, 2024Updated 2 years ago
Cranial-XIX / llm-pddl
View on GitHub
☆456Sep 27, 2023Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
Tiiiger / templm
View on GitHub
Code release for "TempLM: Distilling Language Models into Template-Based Generators"
☆14Jul 21, 2022Updated 3 years ago
CU-DitecT / TRC21-PINN-CFM
View on GitHub
☆13Jul 23, 2023Updated 2 years ago
AGI-Labs / manipulate-by-seeing
View on GitHub
☆10Jun 5, 2024Updated 2 years ago
zhaoyizhou1123 / mbrcsl
View on GitHub
☆11Nov 18, 2023Updated 2 years ago
xlang-ai / text2reward
View on GitHub
[ICLR 2024 Spotlight] Text2Reward: Reward Shaping with Language Models for Reinforcement Learning
☆209Dec 17, 2024Updated last year
facebookresearch / nocturne
View on GitHub
A data-driven, fast driving simulator for multi-agent coordination under partial observability.
☆300Jun 18, 2024Updated 2 years ago
ben-eysenbach / info_geometry
View on GitHub
Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"
☆20Oct 6, 2021Updated 4 years ago
metadriverse / TS2C
View on GitHub
[ICLR 2023] The official code for paper "Guarded Policy Optimization with Imperfect Online Demonstrations"
☆14Apr 30, 2023Updated 3 years ago
kyegomez / LOGICGUIDE
View on GitHub
Plug in and Play implementation of "Certified Reasoning with Language Models" that elevates model reasoning by 40%
☆16Jun 20, 2023Updated 3 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
csmile-1006 / ARP
View on GitHub
Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)
☆33Sep 25, 2023Updated 2 years ago
ShengranHu / Thought-Cloning
View on GitHub
[NeurIPS '23 Spotlight] Thought Cloning: Learning to Think while Acting by Imitating Human Thinking
☆268Jun 28, 2024Updated 2 years ago
Exploration-Lab / ScriptWorld
View on GitHub
☆20Jun 21, 2025Updated last year
siddharthverma314 / clcp-neurips-2020
View on GitHub
Code for Continual Learning of Control Primitives
☆18Nov 11, 2020Updated 5 years ago
AlignmentResearch / vlmrm
View on GitHub
☆72Jun 25, 2024Updated 2 years ago
google-deepmind / language_to_reward_2023
View on GitHub
☆161Aug 19, 2024Updated last year
penn-pal-lab / interactive_reward_functions
View on GitHub
Code release for "Training Robots to Evaluate Robots" (CoRL'22, Best Paper Award)
☆17Feb 15, 2023Updated 3 years ago
mxu34 / prompt-dt
View on GitHub
Official code repository for Prompt-DT.
☆123Aug 3, 2022Updated 3 years ago
HumanCompatibleAI / human_ai_robustness
View on GitHub
☆22Jul 15, 2020Updated 5 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
tianjunz / HIR
View on GitHub
☆157Mar 18, 2023Updated 3 years ago
HazyResearch / TART
View on GitHub
TART: A plug-and-play Transformer module for task-agnostic reasoning
☆202Jun 22, 2023Updated 3 years ago
anuragajay / hip
View on GitHub
Codebase for HiP
☆90Dec 15, 2023Updated 2 years ago
YushuoLi / Gato-A-Generalist-Agent
View on GitHub
Minimal code for A Generalist Agent
☆44Nov 4, 2022Updated 3 years ago
UKPLab / on-emergence
View on GitHub
Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning
☆33Jan 9, 2025Updated last year
TonghanWang / EITI-EDTI
View on GitHub
Codes accompanying the paper "Influence-Based Multi-Agent Exploration" (ICLR 2020 spotlight)
☆34Mar 16, 2020Updated 6 years ago
mees / hulc2
View on GitHub
[ICRA2023] Grounding Language with Visual Affordances over Unstructured Data
☆48Oct 29, 2023Updated 2 years ago
cohere-ai / human-feedback-paper
View on GitHub
Code and data from the paper 'Human Feedback is not Gold Standard'
☆20May 5, 2026Updated 2 months ago
XanderJC / scalable-birl
View on GitHub
Scalable Bayesian Inverse Reinforcement Learning (ICLR 2021) by Alex J. Chan and Mihaela van der Schaar.
☆47Mar 12, 2021Updated 5 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
anthonysimeonov / rpdiff
View on GitHub
☆62Jan 15, 2024Updated 2 years ago
FranxYao / GPT-Bargaining
View on GitHub
Code for Arxiv 2023: Improving Language Model Negociation with Self-Play and In-Context Learning from AI Feedback
☆207May 24, 2023Updated 3 years ago
facebookresearch / multi_view_active_learning
View on GitHub
Code for paper Rethinking the Data Annotation Process for Multi-view 3D Pose Estimation with Active Learning and Self-Training
☆22Apr 18, 2023Updated 3 years ago
likenneth / q_probe
View on GitHub
Q-Probe: A Lightweight Approach to Reward Maximization for Language Models
☆40Jun 10, 2024Updated 2 years ago
allenai / contrastive-explanations
View on GitHub
Explaining neural decisions contrastively to alternative decisions.
☆24Mar 18, 2021Updated 5 years ago
xingyaoww / LeTI
View on GitHub
Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."
☆66Jun 29, 2023Updated 3 years ago
daochenzha / autosmote
View on GitHub
[CIKM 2022] Towards Automated Over-Sampling for Imbalanced Classification
☆10Mar 20, 2023Updated 3 years ago