Learn online intrinsic rewards from LLM feedback
☆45Dec 17, 2024Updated last year
Alternatives and similar repositories for oni
Users that are interested in oni are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2023] Official code release accompanying the paper "NetHack is Hard to Hack" (Piterbarg, Pinto, Fergus)☆13Oct 30, 2023Updated 2 years ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Aug 20, 2024Updated last year
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated 2 years ago
- Intrinsic Motivation from Artificial Intelligence Feedback☆135Nov 7, 2023Updated 2 years ago
- The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge☆62Jan 3, 2023Updated 3 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆17May 14, 2026Updated 3 weeks ago
- ☆14Jun 8, 2023Updated 3 years ago
- Reflect-RL: Two-Player Online RL Fine-Tuning for LMs☆18Jul 19, 2025Updated 10 months ago
- A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.☆14Jan 3, 2023Updated 3 years ago
- ☆22Mar 28, 2025Updated last year
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆87Mar 22, 2024Updated 2 years ago
- Generate the WizardCoder Instruct from the CodeAlpaca☆21Jun 27, 2023Updated 2 years ago
- Nethack Learning Environment Wrapper for Language Interface☆42Sep 11, 2023Updated 2 years ago
- Jenkins Plugin for Sysdig Secure☆15Jun 2, 2026Updated last week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- TaskMet Task-driven Metric Learning for Model Learning☆20Feb 9, 2024Updated 2 years ago
- DEPRECATED! Please use Kinetic: https://developer.kin.org/docs/kinetic☆11Aug 12, 2022Updated 3 years ago
- Harness for running and evaluating AI agents against RL environments☆189Updated this week
- Graph Learning with JAX☆14Jul 11, 2022Updated 3 years ago
- Spot Sim2Real Infrastructure☆102May 27, 2025Updated last year
- A framework for evaluating LLMs in Atari games☆15Apr 21, 2025Updated last year
- An ergonomic, opinionated memory interface for AI agents☆39Dec 18, 2025Updated 5 months ago
- Code for "Learning Control-Oriented Dynamical Structure from Data" by Spencer M. Richards, Jean-Jacques Slotine, Navid Azizan, and Marco …☆16Oct 23, 2023Updated 2 years ago
- [NAACL 2021] Reading and Acting while Blindfolded: The Need for Semantics in Text Game Agents☆11May 31, 2021Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆24Dec 9, 2020Updated 5 years ago
- Official implementation of "Latent Action Learning Requires Supervision in the Presence of Distractors", ICML 2025☆36Jul 8, 2025Updated 11 months ago
- ☆15Mar 21, 2024Updated 2 years ago
- A custom implementation of the malloc..etc☆15Jan 22, 2026Updated 4 months ago
- ☆10Jun 8, 2024Updated 2 years ago
- Qt-like event loops, signals and slots for communication across threads and processes in Python☆14Mar 26, 2024Updated 2 years ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆25May 17, 2026Updated 3 weeks ago
- ☆10Nov 6, 2024Updated last year
- A lightweight computational physics framework, based on the organization of turboWAVE. Implements a "Simulation, PhysicsModule, ComputeTo…☆12Apr 1, 2026Updated 2 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Rocky Linux images for Orange Pi☆11Jul 30, 2023Updated 2 years ago
- Object Centric Atari games☆100Dec 5, 2025Updated 6 months ago
- ☆10Mar 8, 2025Updated last year
- The asterai CLI and runtime for running WASM components bundled in environments.☆20Mar 21, 2026Updated 2 months ago
- a benchmark to evaluate the situated inductive reasoning☆16Jan 7, 2025Updated last year
- ☆17May 15, 2025Updated last year
- Few-shot Bayesian Imitation Learning with Policies as Logic over Programs☆21Oct 19, 2025Updated 7 months ago