akshaykhadse / reinforcement-learningView external linksLinks
Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay
☆16May 21, 2018Updated 7 years ago
Alternatives and similar repositories for reinforcement-learning
Users that are interested in reinforcement-learning are comparing it to the libraries listed below
Sorting:
- Thompson Sampling for Bandits using UCB policy☆10Jul 29, 2017Updated 8 years ago
- ☆11Sep 18, 2025Updated 4 months ago
- A lightweight packet-level OMNeT++ simulator designed to simulate large FatTree data center networks.☆11Nov 19, 2013Updated 12 years ago
- rtsp stream to hls☆10Mar 3, 2020Updated 5 years ago
- solver for discrete Mixed Observable Markov Decision Processes☆11Oct 30, 2020Updated 5 years ago
- AMP active measurement client software☆10Jul 6, 2025Updated 7 months ago
- Julia implementations of temporal difference Reinforcement Learning algorithms like Q-Learning and SARSA☆13Nov 16, 2025Updated 2 months ago
- import library.zip in memory, with the interface the same as zipimport.☆10Jan 10, 2026Updated last month
- ☆12Jun 18, 2023Updated 2 years ago
- Codes for Evolving Plastic ANNs☆14Dec 18, 2022Updated 3 years ago
- hls player, flv player , simple MSE player (hls & fmp4 & flv live) , functional style☆10May 2, 2020Updated 5 years ago
- Implementation of the X-armed Bandits algorithm, as detailed in the paper, "X-armed Bandits", Bubeck et al., 2011.☆11Jul 12, 2018Updated 7 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- ns2 repository snagged from the debian git repo, with van jacobson and kathie nichol's codel and fq_codel implementations☆12Aug 24, 2012Updated 13 years ago
- An open source reinforcement learning codebase with a variety of intrinsic exploration methods implemented in PyTorch.☆11Feb 6, 2023Updated 3 years ago
- Automatic code generator for training Reinforcement Learning policies☆11Jan 3, 2021Updated 5 years ago
- Multi-path UDP protocol - an example implementation☆10Jul 6, 2015Updated 10 years ago
- Integration and automation of NS-3 network simulator and Linux Containers☆12Nov 12, 2019Updated 6 years ago
- Synthesize bio-plausible neural networks for cognitive tasks, mimicking brain architecture☆11Apr 14, 2021Updated 4 years ago
- Free, fast web-conferencing and webrtc-as-a-service.☆10Feb 9, 2022Updated 4 years ago
- Caffe implementation of "Two-Stage Convolutional Network for Image Super-Resolution" (ICPR 2018)☆10Dec 4, 2018Updated 7 years ago
- ☆11May 27, 2023Updated 2 years ago
- Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning☆11Jun 16, 2022Updated 3 years ago
- This is a project using Pytorch to fulfill reinforcement learning on a simple game - Gridworld☆13Jul 13, 2020Updated 5 years ago
- ☆12Aug 12, 2022Updated 3 years ago
- Contextual Combinatorial Cascading Bandits☆10Jun 30, 2016Updated 9 years ago
- Code for "Dream and Search to Control: Latent Space Planning for Continuous Control"☆12Jul 12, 2021Updated 4 years ago
- PyTorch implementation of Optimistic Adam proposed in Training GANs with Optimism (https://arxiv.org/pdf/1711.00141.pdf)☆20Jan 16, 2021Updated 5 years ago
- Python Bindings to the Lean Theorem Prover http://leanprover.github.io/☆13Sep 12, 2017Updated 8 years ago
- OpenAI Gym compatible reinforcement learning environment for Space Fortress https://arxiv.org/abs/1809.02206☆11Aug 30, 2024Updated last year
- Minimalistic port of NanoGUI claim works with SDL API w/o external dependencies.☆12Sep 4, 2019Updated 6 years ago
- Very dirty approach to boot U-Boot on Samsung I9300. NOTE: It does NOT boot kernel and will eat your cat. For an actually working bootloa…☆14Aug 24, 2013Updated 12 years ago
- Routing with reinforcement learning☆10Apr 9, 2022Updated 3 years ago
- Heuristic Dynamic Programming with Python☆14Jul 28, 2014Updated 11 years ago
- This is a project based on OpenAI's multi-agent-emergence-environments (Emergent Tool Use from Multi-Agent Autocurricula, Baker et al.), …☆13Jan 5, 2021Updated 5 years ago
- A C++ implementation of Network Simplex Algorithm☆11Nov 12, 2018Updated 7 years ago
- This repository contains the open source code used to generate the simulation results shown in the manuscript "Jaeyoung Lee and Richard S…☆12May 21, 2021Updated 4 years ago
- a little library to help me with things involving Koopman operators☆12Mar 3, 2022Updated 3 years ago
- Landing a rocket in unity3d simulation using python☆14Jul 18, 2021Updated 4 years ago