使用OpenAI Gym实现游戏AI
☆16Oct 6, 2017Updated 8 years ago
Alternatives and similar repositories for Reinforcement-learning-demos-annotated
Users that are interested in Reinforcement-learning-demos-annotated are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- 强化学习训练斗地主 / doudizhu AI using reinforcement learning.☆19Sep 19, 2019Updated 6 years ago
- This is the official code my master thesis☆10Jan 17, 2024Updated 2 years ago
- 基于深度强化学习DQN的FlappyBird游戏AI开发☆16Aug 12, 2019Updated 6 years ago
- [CVPR 2023] The official Pytorch implementation of Re-GAN☆20Jul 11, 2023Updated 2 years ago
- Catch game example is translated by TensorFlow☆16May 8, 2017Updated 8 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implicit Motion Function - (unofficial) Microsoft recreation☆29Nov 19, 2024Updated last year
- 2048 environment for Reinforcement Learning and DQN algorithm☆40May 27, 2022Updated 3 years ago
- Dense Interspecies Face Embedding (NeurIPS 2022)☆25May 16, 2023Updated 2 years ago
- I2Q: A Fully Decentralized Q-Learning Algorithm☆19Nov 10, 2022Updated 3 years ago
- 基于强化学习(RL)的冰壶游戏实例; 梯度下降的Sarsa(lambda) + 非均匀径向基特征表示☆21Jul 5, 2020Updated 5 years ago
- Interface definitions for the Compute@Edge platform in witx.☆15Feb 11, 2022Updated 4 years ago
- Deep Q Learning playing breakout on OpenGym☆23Jun 11, 2018Updated 7 years ago
- 星辰新人培训仓库☆24Mar 11, 2017Updated 9 years ago
- Code for paper "Learning Semantic Latent Directions for Accurate and Controllable Human Motion Prediction" (ECCV 2024)☆32Jul 31, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆11Jan 3, 2024Updated 2 years ago
- Benchmarks for CNTK and other toolkits.☆44Dec 12, 2015Updated 10 years ago
- HAProxy combined with confd for HTTP load balancing with SSL offloading☆10Feb 5, 2017Updated 9 years ago
- A repository for a Deep Q-Learning approach to intrusion detection for networks cyber-attacks.☆10Sep 3, 2021Updated 4 years ago
- An open-source non-official community implementation of the model from the paper: Surgical Robot Transformer (SRT): Imitation Learning fo…☆12Apr 20, 2026Updated 2 weeks ago
- Multilingual Pre-training with Language and Task Adaptation for Multilingual Text Style Transfer (ACL 2022)☆10Sep 22, 2022Updated 3 years ago
- Official PyTorch implementation of the paper "Neural Congealing: Aligning Images to a Joint Semantic Atlas" (CVPR 2023)☆49Aug 14, 2023Updated 2 years ago
- A fork of the Linux kernel for NVMEoF target driver using PCI P2P capabilities for full I/O path offloading.☆15Jun 20, 2021Updated 4 years ago
- Play flappy bird with DQN, a demo for reinforcement learning, implemented using PyTorch☆71May 2, 2017Updated 9 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆11Jan 11, 2022Updated 4 years ago
- ☆10Dec 10, 2021Updated 4 years ago
- An open source community who focuses on developing and publishing elegant algorithms, models and tools for science big data mining and kn…☆11Jul 27, 2019Updated 6 years ago
- My undergraduate final project - Modeling and control of a distillation column using neural networks and reinforcement learning.☆12Apr 28, 2020Updated 6 years ago
- Reproduction of Curiosity-driven Exploration by Self-supervised Prediction in PyTorch☆13Jun 10, 2019Updated 6 years ago
- ☆10Oct 20, 2022Updated 3 years ago
- Add function calling to text-generation-inference☆13Oct 10, 2023Updated 2 years ago
- Jest preset for Fastly Compute@Edge☆11Mar 12, 2025Updated last year
- Protect workers with TensorFlow Hard Hat object detection model on a Jetson Nano☆10Sep 27, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Neural network reinforcement Q-learning for an avoidance game☆10Aug 21, 2017Updated 8 years ago
- 🌟 SwarmAgent: A framework for simulating social group dynamics using multi-agent collaboration, aiding insights into collective behavior…☆13Dec 5, 2023Updated 2 years ago
- Various DQN method with cartpole☆11May 30, 2018Updated 7 years ago
- Official implementation of the UMDQN algorithm presented in the scientific research paper entitled "Distributional Reinforcement Learning…☆11Jun 3, 2022Updated 3 years ago
- Authors official PyTorch implementation of the "Finding Directions in GAN’s Latent Space for Neural Face Reenactment" [BMVC 2022].☆52Sep 26, 2023Updated 2 years ago
- Code for our TSD paper "TOKEN is a MASK: Few-shot Named Entity Recognition with Pre-trained Language Models"☆14Aug 19, 2022Updated 3 years ago
- Thesis in Federated Learning using an Edge/Cloud Computing architecture☆10Feb 26, 2021Updated 5 years ago