☆25Jun 30, 2022Updated 4 years ago
Alternatives and similar repositories for Competition_Olympics-Integrated
Users that are interested in Competition_Olympics-Integrated are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Jul 5, 2021Updated 4 years ago
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 4 years ago
- Code repository for On the interaction between supervision and self-play in emergent communication (ICLR 2020)☆15Feb 4, 2020Updated 6 years ago
- Population-Based Training in Python☆18Oct 8, 2018Updated 7 years ago
- ☆73Feb 4, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Scalable Multi-Agent Reinforcement Learning☆15Dec 25, 2021Updated 4 years ago
- Additional environments compatible with OpenAI gym☆23Mar 11, 2021Updated 5 years ago
- ☆12Apr 1, 2025Updated last year
- original source code of the ASE 2019 paper: Wuji: Automatic Online Combat Game Testing Using Evolutionary Deep Reinforcement Learning☆28Jun 8, 2020Updated 6 years ago
- Extreme Q-Learning: Max Entropy RL without Entropy☆88Feb 14, 2023Updated 3 years ago
- 该项目可以根据用户给出的上文自动生成下文 该项目是本人的本科毕业设计。项目主要基于GPT-2 Chinese实现。本人的工作主要是用新的语料库进行了几次训练,得出来了一个还凑合的模型。该项目已经初步完成,不再进行进一步的更新。☆12Jun 9, 2020Updated 6 years ago
- Making maps from DOOM in Rust☆11Mar 12, 2018Updated 8 years ago
- ☆174Oct 9, 2023Updated 2 years ago
- Offline RL experiments☆15Oct 1, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [TPAMI] "Symbolic Visual Reinforcement Learning: A Scalable Framework with Object-Level Abstraction and Differentiable Expression Search"…☆18Jan 4, 2023Updated 3 years ago
- Generating Counterfactual Explanation Images through Generative Adversarial Learning☆12Jul 1, 2021Updated 5 years ago
- PLM: Efficient Peripheral Language Models Hardware-Co-Designed for Ubiquitous Computing☆21Mar 18, 2025Updated last year
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆29Sep 10, 2020Updated 5 years ago
- Unity RhythmTimeline☆13Dec 14, 2021Updated 4 years ago
- Code accompanying the paper "Noise Contrastive Alignment of Language Models with Explicit Rewards" (NeurIPS 2024)☆58Nov 8, 2024Updated last year
- ☆13Jul 25, 2023Updated 2 years ago
- We investigate the effect of populations on finding good solutions to the robust MDP☆29Mar 27, 2021Updated 5 years ago
- Code for "Goal-Conditioned Predictive Coding for Offline Reinforcement Learning" (NeurIPS 2023)☆14Dec 8, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Level-based Foraging (LBF): A multi-agent environment for RL☆211Sep 15, 2024Updated last year
- Jax implementation of VIT-VQGAN☆10Jan 25, 2024Updated 2 years ago
- Official implementation for "PEAC: Unsupervised Pre-training for Cross-Embodiment Reinforcement Learning" (NeurIPS 2024)☆19Oct 13, 2024Updated last year
- ☆33Jul 30, 2024Updated last year
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Jan 31, 2023Updated 3 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Feb 14, 2019Updated 7 years ago
- Official Codebase for "Aligning Diffusion Behaviors with Q-functions for Efficient Continuous Control" (NeurIPS 2024)☆15Oct 29, 2024Updated last year
- A set of competitive environments for Reinforcement Learning research.☆31Dec 1, 2022Updated 3 years ago
- 中文版hf-alignment-handbook,大模型全套sft、dpo、orpo、cpt训练教程.☆15Aug 25, 2024Updated last year
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 统计微信朋友圈送出的赞票与得到的赞票人员比例☆11May 3, 2016Updated 10 years ago
- ☆12May 12, 2026Updated last month
- ☆13Jul 9, 2021Updated 4 years ago
- Applying PBT optimization technique to different domains☆10Oct 16, 2019Updated 6 years ago
- PyTorch implementation of Count-Based Exploration with Neural Density Models☆10Mar 22, 2018Updated 8 years ago
- Official codebase for Adaptive Online Planning for Continual Lifelong Learning.☆17Mar 26, 2020Updated 6 years ago
- 这是一个适配7.0模型的pyqt界面2☆18May 5, 2023Updated 3 years ago