The purpose of this project is to research Artificial Intelligence and Reinforcement Learning. In the AI Arena, multiple agents can interact with a single environment. After sending its action, each each agent will receive a reward. This allows agents to learn, improve their behavior and to adapt to each other. Interesting phenomena can arise..…
☆39Oct 31, 2017Updated 8 years ago
Alternatives and similar repositories for AI_Arena
Users that are interested in AI_Arena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tweaks to Flash ver. of Notch's Infinite Mario Bros: Working score, levels that stay after death, etc☆12Jul 27, 2021Updated 4 years ago
- An experiment with Thompson sampling and TD(0) on a grid world variant☆17Nov 8, 2013Updated 12 years ago
- A Lisp bytecode interpreter for ZX-Spectrum☆16Jul 3, 2018Updated 7 years ago
- IPython Magic Functions☆16Aug 14, 2017Updated 8 years ago
- CS 294: Deep Reinforcement Learning, Spring 2017 Berkeley☆11Feb 19, 2017Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Reinforcement Learning Algorithm for Packet Routing☆12Aug 20, 2020Updated 5 years ago
- Train I3D on NTU-RGB+D dataset in keras☆11Feb 5, 2019Updated 7 years ago
- Code for ICLR 2019 paper "Efficient Augmentation via Data Subsampling"☆15Feb 20, 2019Updated 7 years ago
- DataBright: Towards a Global Exchange for Decentralized Data Ownership and Trusted Computation☆13Jun 28, 2018Updated 7 years ago
- Model-Free Episodic Control☆14Jan 12, 2017Updated 9 years ago
- ☆11Feb 22, 2018Updated 8 years ago
- A comparison of RCN/CNN/SVM/KNN on EMNIST-letters dataset☆10Dec 18, 2017Updated 8 years ago
- Python package for virtual screening of generated molecules using autodock-vina and tensorflow☆14Mar 22, 2021Updated 5 years ago
- homework for shenlan's "Motion Planning For Mobile Robots "☆15May 14, 2020Updated 6 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- coding examples to Intro to RL☆13Apr 30, 2018Updated 8 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 8 years ago
- Matlab code for learning doubly sparse dictionary on synthetic data. Details can be found in the paper "A Provable Approach for Double-Sp…☆11Mar 5, 2018Updated 8 years ago
- MIT CADR original verilog and simulator☆18Jan 2, 2016Updated 10 years ago
- Train and Visualize Binary Neural Networks (Code for: The High-Dimensional Geometry of Binary Neural Networks)☆13Jan 31, 2018Updated 8 years ago
- Basic assembler for the basic CPU at https://embeddedmicro.com/tutorials/lucid/basic-cpu☆15Aug 19, 2015Updated 10 years ago
- growing interpretable part graphs on convnets via multi-shot learning, in AAAI 2017☆16May 28, 2017Updated 9 years ago
- This repository contains implementations of the paper VUSFA☆14Mar 31, 2021Updated 5 years ago
- Repository for code experimenting with RL and Solar Tracking☆13Apr 24, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Reproducing the reinforcement learning models used in "Emergence of Linguistic Communication from Referential Games with Symbolic and Pix…☆12Jun 23, 2018Updated 7 years ago
- Official gym API for game FightingICE.☆15Feb 17, 2024Updated 2 years ago
- Comparison between Sarsa and Q-Learning algorithms on risk handling☆17Jul 10, 2017Updated 8 years ago
- Code to automate multiple-camera calibration (2D or 3D) with oriented chessboard using AprilTag. Either for static or eye-in-hand applica…☆14Jun 22, 2022Updated 3 years ago
- a motion detector for video; written with OpenCV☆12Nov 3, 2022Updated 3 years ago
- Re-Implementation of Gaussian Process Latent Variable Model algorithm & performance assessment against Kernel-PCA☆15Oct 9, 2024Updated last year
- Source Code for 'Deep Reinforcement Learning in Unity' by Abhilash Majumder☆18Feb 24, 2021Updated 5 years ago
- Balance chemical equations☆10May 26, 2022Updated 4 years ago
- This repo is for reproducing our results in “Activation Maximization Generative Adversarial Nets”.☆11Sep 26, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Contains an implementation of "Imitation Learning via Kernel Mean Embedding (2018, AAAI)"☆11Oct 2, 2018Updated 7 years ago
- ☆14May 24, 2018Updated 8 years ago
- Recources to build the MFOS - Noise Toaster Synth by Ray Wilson☆17Mar 25, 2024Updated 2 years ago
- Sinclair ZX Spectrum 48 emulator in Java☆18Oct 26, 2012Updated 13 years ago
- Rework of PUBG game into Minecraft. Includes custom minigames such as Domination, Deathmatch and of course Battle Royale☆10May 11, 2025Updated last year
- Notes for Deep Learning Papers☆19Sep 21, 2018Updated 7 years ago
- quaprogIP solver for Non-Convex quadratic programs☆11Jun 28, 2019Updated 6 years ago