β14Oct 11, 2022Updated 3 years ago
Alternatives and similar repositories for Open-Ended-Reinforcement-Learning-with-Neural-Reward-Functions
Users that are interested in Open-Ended-Reinforcement-Learning-with-Neural-Reward-Functions are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π The classic snake game implemented in my own wayβ11Feb 9, 2025Updated last year
- β12May 16, 2024Updated last year
- π A toy object-oriented programming language written by rustβ17Apr 10, 2024Updated last year
- Repository to reproduce the results of the paper "Holomorphic Equilibrium Propagation Computes Exact Gradients Through Finite Size Oscillβ¦β11Oct 20, 2024Updated last year
- EARL: Environment for Autonomous Reinforcement Learningβ37Nov 24, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- [ICLR 2025] Highly Efficient Self-Adaptive Reward Shaping for Reinforcement Learning (SASR)β10Aug 26, 2025Updated 7 months ago
- LINQPad like XHTML dumping method.β18Sep 2, 2013Updated 12 years ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)β28Jun 3, 2023Updated 2 years ago
- Tools to Support OpenAtlas developmentβ13Jul 9, 2019Updated 6 years ago
- β10Jun 5, 2024Updated last year
- Scalable and Stable Parallelization of Nonlinear RNNSβ29Mar 6, 2026Updated 3 weeks ago
- Performant and flexible rate limiting algorithm for postgresβ14Sep 2, 2017Updated 8 years ago
- Data and Code for StructuredRegex.β14Nov 16, 2023Updated 2 years ago
- Prototype web interface that enables remote teleoperation of the Stretch RE1 mobile manipulator from Hello Robot Inc.β12Dec 14, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Listwise Reward Estimation for Offline Preference-based Reinforcement Learning (ICML 2024)β17Jun 18, 2024Updated last year
- A framework to train agents in a Unity environment using a genetic algorithmβ35Jan 29, 2020Updated 6 years ago
- Continual learning of task-specific approximations of the parameter posterior distribution via a shared hypernetwork.β16Nov 1, 2024Updated last year
- A simple hypernetwork implementation in jax using haiku.β24Aug 20, 2022Updated 3 years ago
- C++ Library for Interfacing with Libfranka and Frankapyβ64Feb 10, 2025Updated last year
- Implementation of the "Online learning of long-range dependencies" paper, NeurIPS 2023β21Nov 4, 2024Updated last year
- Tool for finding minimal pairs given a corpus of wordsβ13Oct 13, 2016Updated 9 years ago
- Code and Experiments for L4DC 2021 Paper: "Learning Visually Guided Latent Actions for Assistive Teleoperation"β14May 4, 2021Updated 4 years ago
- Code for paper: Reward Uncertainty for Exploration in Preference-based Reinforcement Learningβ15May 26, 2022Updated 3 years ago
- NordVPN Threat Protection Proβ’ β’ AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- β13Jun 3, 2022Updated 3 years ago
- β20Jan 4, 2023Updated 3 years ago
- This repository contains the code for our ECCV 2022 paper on our "Non-isotropic Probabilistic Take on Proxy-based Deep Metric Learning".β12Dec 6, 2022Updated 3 years ago
- The official implementation of "Mind the Gap: Offline Policy Optimization for Imperfect Rewards" (ICLR2023)β16Mar 3, 2023Updated 3 years ago
- Official codebase for "B-Pref: Benchmarking Preference-BasedReinforcement Learning" contains scripts to reproduce experiments.β134Nov 3, 2021Updated 4 years ago
- Task Success is not Enough: Investigating the Use of Video-Language Models as Behavior Critics for Catching Undesirable Agent Behaviorsβ12Aug 11, 2024Updated last year
- A PyTorch implementation for the paper 'Extrapolating Beyond Suboptimal Demonstrations via Inverse Reinforcement Learning from Observatioβ¦β14Sep 22, 2021Updated 4 years ago
- A clojure wrapper for usearch, a fast open-source search & clustering engine for vectors.β23Nov 12, 2024Updated last year
- Sketch Driven Regular Expression Generation.β16Apr 26, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Multi-Objective Reinforcement Learning sandboxβ12Dec 20, 2021Updated 4 years ago
- Minimal Pairs scraping/anki deck creationβ10Dec 8, 2022Updated 3 years ago
- <κ°λ°μλ₯Ό μν νμ μν>(νλΉλ―Έλμ΄, 2024)μ μ½λ μ μ₯μβ18Jan 9, 2025Updated last year
- β11Jul 13, 2018Updated 7 years ago
- Quadruped Robot controller design and simulation on Webotsβ12Apr 28, 2020Updated 5 years ago
- A project copied from google-research which named motion-imitation was rewrited with PyTorchβ10Sep 30, 2022Updated 3 years ago
- Accompanying code for the RSS 2019 paper, "Learning Reward Functions by Integrating Human Demonstrations and Preferences"β12May 20, 2019Updated 6 years ago