Learn online intrinsic rewards from LLM feedback
☆45Dec 17, 2024Updated last year
Alternatives and similar repositories for oni
Users that are interested in oni are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] Official code release accompanying the paper "NetHack is Hard to Hack" (Piterbarg, Pinto, Fergus)☆12Oct 30, 2023Updated 2 years ago
- ☆15Updated this week
- See https://github.com/cuda-mode/triton-index/ instead!☆11May 8, 2024Updated last year
- Intrinsic Motivation from Artificial Intelligence Feedback☆134Nov 7, 2023Updated 2 years ago
- Reflect-RL: Two-Player Online RL Fine-Tuning for LMs☆18Jul 19, 2025Updated 7 months ago
- Graph Learning with JAX☆14Jul 11, 2022Updated 3 years ago
- [ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)☆20Aug 20, 2024Updated last year
- ☆18Feb 7, 2021Updated 5 years ago
- Harness for running and evaluating AI agents against RL environments☆115Updated this week
- The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge☆62Jan 3, 2023Updated 3 years ago
- ☆19Sep 22, 2025Updated 5 months ago
- PowerBiMIP is an open-source, efficient bilevel mixed-integer programming (BiMIP) solver, with a special focus on applications in power a…☆34Updated this week
- Neural Fixed-Point Acceleration for Convex Optimization☆29Oct 6, 2022Updated 3 years ago
- Disciplined convex stochastic programming. For the cvxstoc home page, please see:☆32Sep 15, 2020Updated 5 years ago
- Tools for visualizing and comparing data from vertebrate retinas☆14Jan 20, 2025Updated last year
- Semi-Supervised Offline Reinforcement Learning with Action-Free Trajectories☆42Jul 16, 2023Updated 2 years ago
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆32May 29, 2024Updated last year
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆87Mar 22, 2024Updated last year
- Your command-line, context-aware chatbot for instant codebase insights & more ✨☆16May 30, 2024Updated last year
- Mod merging tool for The Witcher 3: Wild Hunt [C++, Qt5]☆12Nov 4, 2016Updated 9 years ago
- This project is a Token Sale dApp that allows one to buy tokens and also displays recently minted tokens on the Solana blockchain using t…☆11Jul 30, 2024Updated last year
- raytracer☆10Jul 18, 2022Updated 3 years ago
- ☆13Nov 21, 2025Updated 3 months ago
- Code the AAAI 2019 paper "Melding the Data-Decisions Pipeline: Decision-Focused Learning for Combinatorial Optimization"☆35Feb 12, 2021Updated 5 years ago
- My CV☆39Updated this week
- ☆50Oct 12, 2025Updated 4 months ago
- ☆14Mar 21, 2024Updated last year
- Visualize linear programming at https://lpviz.net☆33Jan 20, 2026Updated last month
- Project template for STAT-4830☆19Feb 16, 2026Updated last week
- Kernel CLI☆13Updated this week
- Simple framework for symbolic manipulation☆11Nov 6, 2024Updated last year
- Kernel Playground - A playground to run large scale experiments on the Linux Kernel☆17Nov 8, 2025Updated 3 months ago
- ☆14Jul 4, 2022Updated 3 years ago
- Source code for the paper titled: "Unlocking the full potential of smart charging: Addressing paused and delayed charging problems in ele…☆11May 22, 2024Updated last year
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆21Jan 6, 2026Updated last month
- Example Systems using PowerDynamics.jl☆12Oct 10, 2022Updated 3 years ago
- Unofficial Knowledge Base for Novel AI☆16Nov 8, 2025Updated 3 months ago
- Official codebase for "Context Aware Deep Learning for Multi Modal Depression Detection" [ICASSP 2019, Oral]☆11Dec 26, 2024Updated last year
- Digital advertising is becoming increasingly important. At the same time, however, the problems of this type of marketing are becoming mo…☆11Oct 4, 2022Updated 3 years ago