☆25Jun 30, 2022Updated 3 years ago
Alternatives and similar repositories for Competition_Olympics-Integrated
Users that are interested in Competition_Olympics-Integrated are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple 2D ball collision engine.☆12Jun 15, 2023Updated 2 years ago
- ☆14Jul 5, 2021Updated 4 years ago
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 4 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆15Feb 4, 2020Updated 6 years ago
- Population-Based Training in Python☆18Oct 8, 2018Updated 7 years ago
- ☆74Feb 4, 2024Updated 2 years ago
- Scalable Multi-Agent Reinforcement Learning☆15Dec 25, 2021Updated 4 years ago
- Additional environments compatible with OpenAI gym☆23Mar 11, 2021Updated 5 years ago
- Logarithmic Reinforcement Learning☆28Apr 7, 2023Updated 2 years ago
- ☆12Apr 1, 2025Updated 11 months ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Feb 14, 2023Updated 3 years ago
- 该项目可以根据用户给出的上文自动生成下文 该项目是本人的本科毕业设计。项目主要基于GPT-2 Chinese实现。本人的工作主要是用新的语料库进行了几次训练,得出来了一个还凑合的模型。该项目已经初步完成,不再进行进一步的更新。☆12Jun 9, 2020Updated 5 years ago
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆20Mar 18, 2025Updated last year
- ☆174Oct 9, 2023Updated 2 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆29Sep 10, 2020Updated 5 years ago
- Unity RhythmTimeline☆12Dec 14, 2021Updated 4 years ago
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆58Nov 8, 2024Updated last year
- ☆13Jul 25, 2023Updated 2 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆28Mar 27, 2021Updated 4 years ago
- ☆19Sep 20, 2024Updated last year
- ☆17Aug 3, 2022Updated 3 years ago
- [RAL 2023] transformer + reinforcement learning for navigation + POMPD☆15Jul 19, 2023Updated 2 years ago
- Image Denoising Using Anisotropic Diffusion☆12Jun 15, 2016Updated 9 years ago
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- Official implementation for "PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning" (NeurIPS 2024)☆19Oct 13, 2024Updated last year
- A Qwen .5B reasoning model trained on OpenR1-Math-220k☆14Oct 11, 2025Updated 5 months ago
- A set of competitive environments for Reinforcement Learning research.☆30Dec 1, 2022Updated 3 years ago
- ☆33Jul 30, 2024Updated last year
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Feb 14, 2019Updated 7 years ago
- ☆13Jul 9, 2021Updated 4 years ago
- Applying PBT optimization technique to different domains☆10Oct 16, 2019Updated 6 years ago
- PyTorch implementation of Count-Based Exploration with Neural Density Models☆10Mar 22, 2018Updated 8 years ago
- Official codebase for Adaptive Online Planning for Continual Lifelong Learning.☆17Mar 26, 2020Updated 5 years ago
- Official repo for vidar and vidarc: video foundation model for robotics.☆40Dec 22, 2025Updated 3 months ago
- A Simple, Distributed and Asynchronous Multi-Agent Reinforcement Learning Framework for Google Research Football AI.☆115Jan 16, 2024Updated 2 years ago
- MVE: model-based value estimation☆11Jul 30, 2018Updated 7 years ago