☆16Feb 23, 2024Updated 2 years ago
Alternatives and similar repositories for instruct-rl
Users that are interested in instruct-rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of the Off Belief Learning algorithm.☆49Aug 18, 2022Updated 3 years ago
- ☆10Apr 23, 2021Updated 5 years ago
- Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork☆13May 2, 2024Updated 2 years ago
- ☆13Feb 25, 2025Updated last year
- Code accompanying our NeurIPS 2021 traffic4cast challenge☆27Sep 16, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Code for "Training Adversarially Robust Sparse Networks via Bayesian Connectivity Sampling" [ICML 2021]☆10Mar 14, 2022Updated 4 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- ☆20Jun 21, 2025Updated 10 months ago
- Code repository for "N-agent Ad Hoc Teamwork" paper (Wang et al., Neurips 2024).☆26Oct 2, 2025Updated 7 months ago
- 亚马逊棋冠军程序细节☆14Jan 7, 2026Updated 4 months ago
- Deep RL Code for XDO: A Double Oracle Algorithm for Extensive-Form Games☆40Aug 27, 2021Updated 4 years ago
- 微软创新杯参赛作品,用C#语言,Unity 3D游戏引擎和Vuforia AR引擎制作的一款解密类AR小游戏☆13Mar 13, 2018Updated 8 years ago
- Various explorations into the game of Poker using MCTS, NFSP, and image-recognition/web-scraping☆13Oct 23, 2020Updated 5 years ago
- 青锋-springboot2.6.x+vue3-antdesign-vite开源架构,实现了系统管理模块、权限控制模块(菜单权限、功能按钮权限、数据权限)、代码生成器(单表、树表)、quartz动态定时器等功能。☆11Apr 30, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- NJU程设实验项目三:爱因斯坦棋☆10May 24, 2019Updated 6 years ago
- ☆17Nov 24, 2025Updated 5 months ago
- SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning☆17Oct 23, 2022Updated 3 years ago
- AMG-RAG (Agentic Medical Graph-RAG) is a comprehensive framework that automates the construction and continuous updating of Medical Knowl…☆30Feb 5, 2026Updated 3 months ago
- ☆12Jan 30, 2021Updated 5 years ago
- ☆223Jun 6, 2023Updated 2 years ago
- More efficient exploration for reinforcement learning in two-player, zero-sum game☆21Jul 30, 2024Updated last year
- This is the official code repository for the paper "Decoding Global Preferences: Temporal and Cooperative Dependency Modeling in Multi-Ag…☆12Apr 9, 2026Updated last month
- Code for Continual Learning of Control Primitives☆18Nov 11, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- illustrate Neural Network (python+LaTeX)☆14Oct 4, 2020Updated 5 years ago
- ☆10Aug 8, 2021Updated 4 years ago
- ☆38Mar 11, 2025Updated last year
- A new dataset of difficult graduate-level applied mathematics problems; evaluations demonstrate that leading LLMs currently exhibit low a…☆28Feb 14, 2025Updated last year
- Codebase for the Graph-based Policy Learning algorithm, which is designed for learning policies to solve the open ad hoc teamwork problem…☆35Mar 31, 2021Updated 5 years ago
- AdaRefiner: Refining Decisions of Language Models with Adaptive Feedback (NAACL 2024)☆19Aug 9, 2024Updated last year
- Code for Policy Consolidation for Continual Reinforcement Learning☆10May 12, 2019Updated 6 years ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆15Nov 14, 2023Updated 2 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Twitter-NFT sales bot that tweets individual and sweep sales with images from Opensea, Looksrare, X2Y2, and Blur using Opensea/Looksrare …☆13Jul 27, 2023Updated 2 years ago
- 服外比赛项目,设计MasterGo插件,通过解析网页生成MasterGo设计稿捏☆15Feb 19, 2023Updated 3 years ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆102Jun 22, 2022Updated 3 years ago
- A Simulated Optimal Intrusion Response Game☆21Apr 3, 2022Updated 4 years ago
- ☆16Apr 6, 2023Updated 3 years ago
- ☆14Nov 26, 2022Updated 3 years ago
- Research code implementing the search AI agent for Hanabi, as well as a web server so people can play against it☆129Jul 18, 2023Updated 2 years ago