The purpose of this project is to research Artificial Intelligence and Reinforcement Learning. In the AI Arena, multiple agents can interact with a single environment. After sending its action, each each agent will receive a reward. This allows agents to learn, improve their behavior and to adapt to each other. Interesting phenomena can arise..…
☆36Oct 31, 2017Updated 8 years ago
Alternatives and similar repositories for AI_Arena
Users that are interested in AI_Arena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Feasible target propagation code for the paper "Deep Learning as a Mixed Convex-Combinatorial Optimization Problem" by Friesen & Domingos…☆28Apr 12, 2018Updated 7 years ago
- A Lisp bytecode interpreter for ZX-Spectrum☆15Jul 3, 2018Updated 7 years ago
- This is the official implementation of the ICML 2023 paper - Can Forward Gradient Match Backpropagation ?☆13May 31, 2023Updated 2 years ago
- CS 294: Deep Reinforcement Learning, Spring 2017 Berkeley☆11Feb 19, 2017Updated 9 years ago
- Workshop on the future of gradient-based machine learning software, NIPS 2017, 2016☆15Jan 8, 2018Updated 8 years ago
- Train I3D on NTU-RGB+D dataset in keras☆11Feb 5, 2019Updated 7 years ago
- [INACTIVE] A bunch of articficial intelligence algorithms☆12May 14, 2016Updated 9 years ago
- Introduction to Reinforcement Learning in Python☆13Oct 17, 2018Updated 7 years ago
- Code for ICLR 2019 paper "Efficient Augmentation via Data Subsampling"☆15Feb 20, 2019Updated 7 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- A comparison of RCN/CNN/SVM/KNN on EMNIST-letters dataset☆10Dec 18, 2017Updated 8 years ago
- This is a sample implementation of "TIMERS: Error-Bounded SVD Restart on Dynamic Networks"(AAAI 2018).☆12Jul 4, 2018Updated 7 years ago
- Modelling SQL Injection Using Reinforcement Learning☆20Oct 13, 2021Updated 4 years ago
- Badminton court and players detection using OpenCV.☆14Feb 21, 2018Updated 8 years ago
- Python package for virtual screening of generated molecules using autodock-vina and tensorflow☆14Mar 22, 2021Updated 5 years ago
- coding examples to Intro to RL☆13Apr 30, 2018Updated 7 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 8 years ago
- Tools to help execute perfect maneuvers☆19May 25, 2021Updated 4 years ago
- MIT CADR original verilog and simulator☆17Jan 2, 2016Updated 10 years ago
- growing interpretable part graphs on convnets via multi-shot learning, in AAAI 2017☆16May 28, 2017Updated 8 years ago
- Repository for code experimenting with RL and Solar Tracking☆13Apr 24, 2018Updated 7 years ago
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pix…☆12Jun 23, 2018Updated 7 years ago
- Official gym API for game FightingICE.☆15Feb 17, 2024Updated 2 years ago
- Comparison between Sarsa and Q-Learning algorithms on risk handling☆17Jul 10, 2017Updated 8 years ago
- Ancient two-player strategy race board game☆12Mar 19, 2024Updated 2 years ago
- Source Code for 'Deep Reinforcement Learning in Unity' by Abhilash Majumder☆18Feb 24, 2021Updated 5 years ago
- a motion detector for video; written with OpenCV☆12Nov 3, 2022Updated 3 years ago
- ☆18Jan 19, 2019Updated 7 years ago
- Website of tracking.js library☆25Jan 18, 2019Updated 7 years ago
- Code for "Boosted Generative Models", AAAI 2018.☆20Dec 26, 2017Updated 8 years ago
- ☆69Sep 21, 2020Updated 5 years ago
- Source code for the following paper(arXiv link): Improved Actor Relation Graph based Group Activity Recognition Zijian Kuang, Xinran Tie☆15Jan 19, 2022Updated 4 years ago
- A document database which stores documents as YAML files. Update, add, remove and view database items by editing files.☆18Feb 25, 2016Updated 10 years ago
- ☆14May 24, 2018Updated 7 years ago
- Notes for Deep Learning Papers☆19Sep 21, 2018Updated 7 years ago
- Forward- and Reverse-Mode Automatic Differentiation for Scala☆21Apr 30, 2011Updated 14 years ago
- Code for a multi-agent particle environment used in the paper "Multi-Agent Actor-Critic for Mixed Cooperative-Competitive Environments"☆2,740Apr 9, 2024Updated last year
- quaprogIP solver for Non-Convex quadratic programs☆11Jun 28, 2019Updated 6 years ago
- Zalt is a home brew Z80 computer with a modern(isch) chipset.☆16Sep 17, 2024Updated last year