Implementation of self-play based reinforcement learning for Checkers based on the AlphaGo Zero methods.
☆19May 8, 2018Updated 8 years ago
Alternatives and similar repositories for alpha-nagibator
Users that are interested in alpha-nagibator are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of Alpha Go Zero algorithm for the game of tic-tac-toe☆16Nov 4, 2017Updated 8 years ago
- An implementation of the Sequence to Sequence model using the Lasagne library (WIP)☆12Aug 11, 2016Updated 9 years ago
- TC-bot using Attention-based Recurrent Neural Network (NLU) and SC-LSTM (NLG)☆14Jan 17, 2018Updated 8 years ago
- Deep reinforcement learning of mahjong self-play☆17Aug 1, 2018Updated 7 years ago
- An implementation of the AlphaZero algorithm for adversarial games to be used with the machine learning framework of your choice☆12Aug 30, 2020Updated 5 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Connect4 reinforcement learning by AlphaGo Zero methods.☆113Apr 5, 2021Updated 5 years ago
- Tools for checking if code is ready for python3☆10Sep 18, 2020Updated 5 years ago
- Amacımız Türkçe NLP literatüründeki birçok farklı sorunu bir arada çözebilen, eşsiz yaklaşımlar öne süren ve literatürdeki çalışmaların e…☆210Oct 28, 2023Updated 2 years ago
- Data Science Cheat Sheet is help to remind code with in minute and also useful to recall the code.Collecting at one place so everyone can…☆26Oct 21, 2017Updated 8 years ago
- Collection of lines of code for basics of clean plots in Plotly and Matplotlib☆14Feb 5, 2021Updated 5 years ago
- Code used to run experiments for the ICLR 2023 paper "Computational Language Acquisition with Theory of Mind".☆15Apr 27, 2023Updated 3 years ago
- Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning☆11Jun 16, 2022Updated 3 years ago
- ☆12Jul 24, 2024Updated last year
- Minimalistic Google Docs based workflow for Distill.pub☆10Jun 14, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ☆17Oct 12, 2023Updated 2 years ago
- 9기 운영진을 위한 repo입니다.☆12Sep 22, 2024Updated last year
- Adaptable Agent Populations via a Generative Model of Policies☆12Oct 14, 2021Updated 4 years ago
- ☆25Sep 30, 2020Updated 5 years ago
- A leaderboard of human and machine performance on the Arcade Learning Environment (ALE).☆22Aug 27, 2018Updated 7 years ago
- ICLR Reproducibility Challenge for Discriminator-Actor-Critic☆20Jan 7, 2019Updated 7 years ago
- A Tensorflow Implementation of the FastSpeech 2: Fast and High-Quality End-to-End Text to Speech☆11Aug 12, 2020Updated 5 years ago
- A Python library that tees the standard output & standard error from the current process to files on disk, while preserving terminal sema…☆16Aug 17, 2023Updated 2 years ago
- ☆15Jun 2, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- GitHub star history graph☆12Jan 29, 2026Updated 3 months ago
- High level Lean 4 FFI for Rust☆14Mar 16, 2024Updated 2 years ago
- ☆10Oct 11, 2022Updated 3 years ago
- Codebase for BRDiv: Diverse teammate generation for ad hoc teamwork☆13May 2, 2024Updated 2 years ago
- Complexity analysis in Lean☆10Feb 5, 2024Updated 2 years ago
- ☆15Sep 7, 2022Updated 3 years ago
- Lean proof that a normed vector space with compact unit ball is finite dimensional☆11Dec 7, 2019Updated 6 years ago
- Companion repository to "Prompt Compression and Contrastive Conditioning for Controllability and Toxicity Reduction in Language Models"☆14May 31, 2023Updated 2 years ago
- Recognize Dog Deep Learning Training Set☆12Mar 12, 2016Updated 10 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official gym API for game FightingICE.☆12Jun 27, 2019Updated 6 years ago
- Official code for "A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning"☆17Mar 1, 2023Updated 3 years ago
- Tutorials on how to use EAGERx☆16Aug 14, 2025Updated 8 months ago
- Interesting ATP Proofs☆13Sep 3, 2021Updated 4 years ago
- Learning Task Embeddings for Teamwork Adaptation in Multi-Agent Reinforcement Learning☆15Apr 25, 2024Updated 2 years ago
- Repository for opt-out requests.☆10Mar 25, 2024Updated 2 years ago
- A clean implementation based on AlphaZero for any game in any framework + tutorial + Othello/Gobang/TicTacToe/Connect4 and more☆4,434Jan 1, 2025Updated last year