[ICML 2024] Official code release accompanying the paper "diff History for Neural Language Agents" (Piterbarg, Pinto, Fergus)
☆20Aug 20, 2024Updated last year
Alternatives and similar repositories for diff_history
Users that are interested in diff_history are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [NeurIPS 2023] Official code release accompanying the paper "NetHack is Hard to Hack" (Piterbarg, Pinto, Fergus)☆13Oct 30, 2023Updated 2 years ago
- [ICLR 2025] "Training LMs on Synthetic Edit Sequences Improves Code Synthesis" (Piterbarg, Pinto, Fergus)☆19Feb 11, 2025Updated last year
- The first place solution for the NeurIPS 2021 Nethack Challenge -- https://www.aicrowd.com/challenges/neurips-2021-the-nethack-challenge☆63Jan 3, 2023Updated 3 years ago
- Nethack Learning Environment Wrapper for Language Interface☆42Sep 11, 2023Updated 2 years ago
- Learn online intrinsic rewards from LLM feedback☆45Dec 17, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A JAX-accelerated implementation of the Procedural Content Generation via Reinforcement Learning (PCGRL) framework. We train RL agents to…☆15Nov 26, 2025Updated 6 months ago
- Find out why your CoreML model isn't running on the Neural Engine!☆30Jun 18, 2024Updated last year
- The code for the paper "A Bayesian Approach to Online Planning" published in ICML 2024.☆13Jun 17, 2024Updated last year
- Python bindings for the Dynamic Animation and Robotics Toolkit☆16Oct 20, 2018Updated 7 years ago
- The NetHack Learning Environment☆132Updated this week
- A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.☆14Jan 3, 2023Updated 3 years ago
- ☆14Jun 8, 2024Updated 2 years ago
- PACIFIC: Towards Proactive Conversational Question Answering over Tabular and Textual Data in Finance☆14May 15, 2024Updated 2 years ago
- Repository contains demo code for MTAnchor, an interactive, multilingual topic modeling system. The code accompanies the paper Multiling…☆12Jan 25, 2019Updated 7 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code for paper "Hierarchically Decoupled Imitation for Morphological Transfer"☆17Mar 24, 2023Updated 3 years ago
- ☆20Nov 3, 2024Updated last year
- TiC: Exploring Vision Transformer in Convolution☆11Oct 24, 2023Updated 2 years ago
- Machine learning (and uncertainty quantification?) of climate model parameterizations using differentiable (and probabilistic?) programmi…☆22Feb 16, 2024Updated 2 years ago
- Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体☆14May 25, 2023Updated 3 years ago
- ☆15Dec 14, 2024Updated last year
- Implementations of Deep RL Algorithms in OpenAI Gym Environments☆15Dec 11, 2020Updated 5 years ago
- Official repo for the E3B algorithm described in the paper "Exploration via Elliptical Episodic Bonuses".☆87Mar 22, 2024Updated 2 years ago
- Dexterous teleoperation for the Stretch mobile manipulators from Hello Robot Inc.☆30Feb 14, 2025Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Code for Environment Probing Interaction Policies [ICLR 2019]☆30Jun 17, 2019Updated 6 years ago
- Transformer Doctor: Diagnosing and Treating Vision Transformers☆11Jan 15, 2025Updated last year
- Code for the paper "Learning to Assist Humans without Inferring Rewards"☆20Jul 7, 2024Updated last year
- Download, parse, and filter data from Court Listener, part of the FreeLaw projects. Data-ready for The-Pile.☆16Jun 3, 2023Updated 3 years ago
- 🐝 Create powerful, collaborative AI applications.☆64Nov 7, 2024Updated last year
- Web version of “Neuroevolution of Self-Interpretable Agents” (https://arxiv.org/abs/2003.08165)☆22Jan 12, 2022Updated 4 years ago
- Benchmarking Agentic LLM and VLM Reasoning On Games☆255Apr 9, 2026Updated 2 months ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆24Dec 29, 2023Updated 2 years ago
- Dataset for the paper: "A multi-task semi-supervised framework for Text2Graph & Graph2Text"☆25Feb 19, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- MiniHack the Planet: A Sandbox for Open-Ended Reinforcement Learning Research☆44Jul 14, 2025Updated 10 months ago
- ALAS: Autonomous Learning Agent System☆18Aug 14, 2025Updated 9 months ago
- Convert GitHub PRs into Harbor tasks☆64Mar 10, 2026Updated 3 months ago
- Production-Grade Autoresearch. Ideal for GPU kernels, ML model development, feature engineering, prompt engineering, and other optimizabl…☆51Updated this week
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆27Jul 9, 2024Updated last year
- [EMNLP 2024] Ask-before-Plan: Proactive Language Agents for Real-World Planning☆23Jul 28, 2025Updated 10 months ago
- Repository containing python wrappers for NVIDIA Omniverse Isaac-Sim☆29May 12, 2021Updated 5 years ago