Soccer toy example simulator used in Reinforcement Learning
☆12Mar 11, 2018Updated 8 years ago
Alternatives and similar repositories for ml_soccer
Users that are interested in ml_soccer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Tutorials for the GA Tech OMSCS RLDM class.☆22May 20, 2018Updated 7 years ago
- Avoiding catastrophic failures in reinforcement learning by learning to shape rewards.☆10Nov 13, 2017Updated 8 years ago
- Experimenting with meta-learning approaches to opponent modelling in MARL. Building upon previous public implementations of MADDPG and M3…☆14Apr 26, 2022Updated 3 years ago
- WebTerm is a Terminal emulator that runs in the browser. It uses v86 to create a virtual linux via WebAssembly and xterm.js as the termin…☆17Apr 28, 2021Updated 4 years ago
- Implementation of Robust Adversarial Reinforcement Learning☆14Nov 27, 2017Updated 8 years ago
- Reference implementation for the paper titled "Improving Model-Based Reinforcement Learning with Internal State Representations through S…☆12Feb 10, 2021Updated 5 years ago
- Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.☆16Apr 1, 2021Updated 4 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Feb 14, 2019Updated 7 years ago
- Official implementation for "Towards Safe Reinforcement Learning via Constraining Conditional Value at Risk" (IJCAI 2022)☆26Aug 29, 2024Updated last year
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Apr 28, 2019Updated 6 years ago
- General purpose, statically typed, functional programming language☆14Dec 6, 2025Updated 3 months ago
- Fundamental of AI course which focuses on search, multiagents, mdp and reinforcement learning algorithms.☆13Oct 29, 2022Updated 3 years ago
- Implementing different learning algorithms and analyzing their performance in a Markov game model called the Soccer Game☆23Jan 29, 2023Updated 3 years ago
- ☆23Mar 17, 2025Updated last year
- [ICLRW'26] EoRA: Fine-tuning-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation☆29Mar 16, 2026Updated last week
- Deep PILCO PyTorch Implementation☆15Mar 25, 2023Updated 3 years ago
- EEG情感识别☆16Dec 9, 2020Updated 5 years ago
- A library to implement counterfactual regret minimization on various abstract strategy games☆18Jan 1, 2025Updated last year
- (Personal project) Pruning algorithm for DNNs using "lottery ticket" pruning☆10Dec 8, 2022Updated 3 years ago
- I am open sourcing the boiler plate code necessary for Assignment 4 so we can focus on the analysis instead.☆49Apr 30, 2017Updated 8 years ago
- Emacs mode for Pig☆25Mar 8, 2023Updated 3 years ago
- ICLR 2019 Paper, "Characterizing Audio Adversarial Examples using Temporal Dependency".☆12Apr 3, 2019Updated 6 years ago
- Recognizing common speech commands using Keras and Tensorflow.☆10Dec 17, 2018Updated 7 years ago
- Javacomplete is a plugin for vim for java completion.☆21Jul 31, 2012Updated 13 years ago
- Preparing continuous features for neural networks with GaussRank☆45Jan 22, 2018Updated 8 years ago
- This image is the zookeeper base. It comes from rawmind/alpine-jvm8.☆16Sep 24, 2018Updated 7 years ago
- ☆33Apr 29, 2023Updated 2 years ago
- Applying DeepMind's MuZero algorithm to the cart pole environment in gym☆21May 6, 2023Updated 2 years ago
- A tool for calling (and calling out to) large language models.☆16Aug 13, 2024Updated last year
- PyTorch code for CVIU paper "AC-VRNN: Attentive Conditional-VRNN for Multi-Future Trajectory Prediction"☆26Jul 8, 2021Updated 4 years ago
- Course code for Machine Learning with Scala Packt Publishing Course☆15Apr 13, 2016Updated 9 years ago
- Code for paper "Model-based Adversarial Meta-Reinforcement Learning" (https://arxiv.org/abs/2006.08875)☆35Mar 6, 2021Updated 5 years ago
- The offical code of "Parameter-Efficient Learning for Text-to-Speech Accent Adaptation"☆13Aug 29, 2023Updated 2 years ago
- Maximum Entropy Population Based Training for Zero-Shot Human-AI Coordination☆26Nov 29, 2022Updated 3 years ago
- Code for the paper "FastAdaSP: An Efficient Multitask Inference Framework for Large Speech Language Models". @ EMNLP'24(Oral)☆17Nov 14, 2024Updated last year
- ☆15Jul 4, 2024Updated last year
- The MIMIC Algorithm Implemented in Python☆24Oct 11, 2016Updated 9 years ago
- ☆17Jul 22, 2024Updated last year
- STRODE: Stochastic Boundary Ordinary Differential Equation☆13Jul 20, 2021Updated 4 years ago