Learn online intrinsic rewards from LLM feedback
☆45Dec 17, 2024Updated last year
Alternatives and similar repositories for oni
Users that are interested in oni are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2023] Official code release accompanying the paper "NetHack is Hard to Hack" (Piterbarg, Pinto, Fergus)☆13Oct 30, 2023Updated 2 years ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆133Nov 7, 2023Updated 2 years ago
- The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge☆63Jan 3, 2023Updated 3 years ago
- ☆17Apr 23, 2026Updated last week
- ☆14Jun 8, 2023Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Reflect-RL: Two-Player Online RL Fine-Tuning for LMs☆18Jul 19, 2025Updated 9 months ago
- A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.☆15Jan 3, 2023Updated 3 years ago
- ☆22Mar 28, 2025Updated last year
- Harness for running and evaluating AI agents against RL environments☆155Updated this week
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆87Mar 22, 2024Updated 2 years ago
- Nethack Learning Environment Wrapper for Language Interface☆42Sep 11, 2023Updated 2 years ago
- TaskMet Task-driven Metric Learning for Model Learning☆19Feb 9, 2024Updated 2 years ago
- Graph Learning with JAX☆14Jul 11, 2022Updated 3 years ago
- A framework for evaluating LLMs in Atari games☆15Apr 21, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for "Learning Control-Oriented Dynamical Structure from Data" by Spencer M. Richards, Jean-Jacques Slotine, Navid Azizan, and Marco …☆16Oct 23, 2023Updated 2 years ago
- [NAACL 2021] Reading and Acting while Blindfolded: The Need for Semantics in Text Game Agents☆11May 31, 2021Updated 4 years ago
- ☆24Dec 9, 2020Updated 5 years ago
- Official implementation of "Latent Action Learning Requires Supervision in the Presence of Distractors", ICML 2025☆34Jul 8, 2025Updated 9 months ago
- A custom implementation of the malloc..etc☆15Jan 22, 2026Updated 3 months ago
- Code for the paper "Batch size invariance for policy optimization"☆60Apr 2, 2023Updated 3 years ago
- ☆18Feb 7, 2021Updated 5 years ago
- ☆25Apr 24, 2026Updated last week
- ☆15Oct 2, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆10Nov 6, 2024Updated last year
- Object Centric Atari games☆99Dec 5, 2025Updated 4 months ago
- TensorRT Accelerate Mask_RCNN☆11Nov 2, 2019Updated 6 years ago
- a benchmark to evaluate the situated inductive reasoning☆15Jan 7, 2025Updated last year
- Barebones Unity3D project, works as a template for a seated VR experience. Implements basic keyboard, menu, and IK functionality.☆29Jul 26, 2023Updated 2 years ago
- Code for the paper Generalization and Equilibrium in GANs☆16Aug 3, 2017Updated 8 years ago
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆15Feb 8, 2026Updated 2 months ago
- A curated list for Efficient Large Language Models☆11Mar 25, 2024Updated 2 years ago
- Code for the ICLR 2024 spotlight paper: "Learning to Act without Actions" (introducing Latent Action Policies)☆140Jul 31, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Train an agent to play VizDoom with multi sensory inputs. Trained using sample factory☆14Jul 9, 2021Updated 4 years ago
- Code and data for Learning Rewards from Linguistic Feedback, AAAI '21☆11Dec 16, 2020Updated 5 years ago
- [ICML 2023] "Outline, Then Details: Syntactically Guided Coarse-To-Fine Code Generation", Wenqing Zheng, S P Sharan, Ajay Kumar Jaiswal, …☆43Nov 9, 2023Updated 2 years ago
- ☆23Feb 4, 2025Updated last year
- Submission code of UEFDRL team to NeurIPS 2019 MineRL challenge (5th place)☆13Nov 13, 2020Updated 5 years ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆247Apr 9, 2026Updated 3 weeks ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year