ai-agents-2030 / DistRL-openView external linksLinks
☆22May 23, 2025Updated 8 months ago
Alternatives and similar repositories for DistRL-open
Users that are interested in DistRL-open are comparing it to the libraries listed below
Sorting:
- Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.☆387Feb 22, 2025Updated 11 months ago
- Benchmarking Mobile Device Control Agents across Diverse Configurations (ICLR 2024 workshop GenAI4DM spotlight presentation; CoLLAs 2025)☆35Jul 21, 2025Updated 6 months ago
- AndroidWorld is an environment and benchmark for autonomous agents☆625Feb 3, 2026Updated last week
- Code repository for scenarios and environment setup as part of ITBench☆15Updated this week
- Stochastic Markov Games☆12Oct 5, 2017Updated 8 years ago
- A GPU (CUDA) accelerated set of tools for object detection using waldboost/LBP.☆10May 25, 2015Updated 10 years ago
- A2C, ACKTR and A2T implementations for ViZDoom☆10Dec 18, 2017Updated 8 years ago
- ☆12Feb 6, 2021Updated 5 years ago
- Maddpg_flight code☆11Jul 4, 2018Updated 7 years ago
- DeepNC: Deep Generative Network Completion☆10Dec 1, 2020Updated 5 years ago
- MADDPG agent with collaboration and competition☆12Nov 9, 2018Updated 7 years ago
- Test scripts for exploring PyTorch JIT and quantization capability☆11Mar 8, 2021Updated 4 years ago
- Code for L4DC 2022 paper: Joint Synthesis of Safety Certificate and Safe Control Policy Using Constrained Reinforcement Learning.☆15Jul 31, 2023Updated 2 years ago
- Source code of “Reinforcement Learning with Token-level Feedback for Controllable Text Generation (NAACL 2024)☆17Dec 8, 2024Updated last year
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆26Nov 7, 2025Updated 3 months ago
- The official implementation of Instance As Identity: A Generic Online Paradigm for Video Instance Segmentation.☆17Sep 19, 2022Updated 3 years ago
- WIP implementation of "The Predictron: End-To-End Learning and Planning" (http://arxiv.org/abs/1612.08810) in Chainer☆11Dec 31, 2016Updated 9 years ago
- ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World☆24Jun 17, 2025Updated 7 months ago
- An open-source framework to benchmark and assess safety specifications of Reinforcement Learning problems.☆14Aug 25, 2023Updated 2 years ago
- Feasibility Consistent Representation Learning for Safe Reinforcement Learning (ICML 2024). Current SOTA model-free safe RL algorithm on …☆13Jul 12, 2024Updated last year
- Implementation for ACER in tensorflow and sonnet by deepmind☆11Aug 28, 2017Updated 8 years ago
- ☆15Jan 7, 2022Updated 4 years ago
- A Universal Platform for Training and Evaluation of Mobile Interaction☆60Sep 24, 2025Updated 4 months ago
- ☆16Oct 3, 2022Updated 3 years ago
- the solustion to https://openai.com/requests-for-research☆12Mar 23, 2017Updated 8 years ago
- Environments with IC3Net paper☆14Jan 8, 2019Updated 7 years ago
- discrete gate sizing☆14Nov 23, 2020Updated 5 years ago
- Python version of "Fast Training of Triplet-based Deep Binary Embedding Networks" by Zhuang et al.☆12Sep 8, 2016Updated 9 years ago
- Progressive Attention Networks☆12Oct 25, 2016Updated 9 years ago
- Build TVM docker image for production compilation deployments☆12Sep 7, 2021Updated 4 years ago
- Code release for "Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search" published at NeurIPS '24.☆17Feb 21, 2025Updated 11 months ago
- ☆14Jun 21, 2016Updated 9 years ago
- High granularity and accuracy Starcraft replay data extractor which outputs to a database☆14Feb 18, 2022Updated 3 years ago
- Very very simple run on sumo☆13May 14, 2018Updated 7 years ago
- learning to play atari games with reinforcement learning☆10Jan 4, 2016Updated 10 years ago
- ☆12Jul 31, 2025Updated 6 months ago
- Official implementation for "You Only Look at Screens: Multimodal Chain-of-Action Agents" (Findings of ACL 2024)☆255Jul 16, 2024Updated last year
- Towards a Mechanistic Interpretation of Multi-Step Reasoning Capabilities of Language Models☆15Nov 4, 2023Updated 2 years ago
- Intrinsic Motivation and Automatic Curricula via Asymmetric Self-Play☆14May 1, 2018Updated 7 years ago