This is a pip package implementing Reinforcement Learning algorithms in non-stationary environments supported by the OpenAI Gym toolkit.
☆33Jun 5, 2019Updated 6 years ago
Alternatives and similar repositories for dyna-gym
Users that are interested in dyna-gym are comparing it to the libraries listed below
Sorting:
- Continual Reinforcement Learning in 3D Non-stationary Environments☆39Jun 16, 2019Updated 6 years ago
- ☆119Jul 17, 2024Updated last year
- Finetuning InstructLLaMA on consumer hardware (copy from https://github.com/tloen/alpaca-lora)☆11Mar 17, 2023Updated 2 years ago
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Jul 12, 2017Updated 8 years ago
- BBRL is a C++ open-source library used to compare Bayesian reinforcement learning algorithms☆34Feb 18, 2016Updated 10 years ago
- NHS England PhD Internship Projects Pages☆19Oct 3, 2025Updated 5 months ago
- Asynchronous Advantage Actor Critic☆20Aug 15, 2016Updated 9 years ago
- My personal web page☆11Feb 17, 2026Updated 2 weeks ago
- Drift detection module for machine learning pipelines.☆24Jun 21, 2023Updated 2 years ago
- A Deep Reinforcement Learning neural net for an original Multi-Dimensional Pairs Trading strategy is proposed☆21Dec 11, 2018Updated 7 years ago
- The World's Most Difficult video game☆32Dec 24, 2025Updated 2 months ago
- Labels calculation&visualisation - comes with a small BTC/USDT database. Part of my research. Integral part of: https://arxiv.org/abs/201…☆27Aug 5, 2022Updated 3 years ago
- A script for collecting the PubMed Central dataset in a language modelling friendly format.☆25Feb 16, 2021Updated 5 years ago
- A multi-agent mind implemented using LLMs engaged in ongoing conversation☆25Mar 1, 2023Updated 3 years ago
- ChatGPT Participates in a Computer Science Exam (2023)☆31Mar 21, 2023Updated 2 years ago
- WeatherFusionNet - our solution to the NeurIPS 2022 Weather4cast competition☆33Nov 30, 2023Updated 2 years ago
- SafeLife: safety benchmarks for reinforcement learning agents☆61May 13, 2021Updated 4 years ago
- StepCoder: Improve Code Generation with Reinforcement Learning from Compiler Feedback☆74Aug 31, 2024Updated last year
- Program and links to the material for the GloBIAS Training School 2025, Kobe, Japan.☆22Oct 27, 2025Updated 4 months ago
- Talk to your CSV: how to Visualize Your Data with Langchain and Streamlit☆29Aug 26, 2023Updated 2 years ago
- JAX-LOB: A GPU-Accelerated limit order book simulator to unlock large scale reinforcement learning for trading☆44Oct 22, 2023Updated 2 years ago
- Financial Analysis and Algorithmic Trading Strategies in Python☆11Feb 16, 2023Updated 3 years ago
- A sample Java gRPC client for the Salesforce Pub/Sub API☆12Oct 9, 2024Updated last year
- LLM-powered Q/A over arXiv preprints☆32Apr 5, 2023Updated 2 years ago
- Value iteration, policy iteration, and Q-Learning in a grid-world MDP.☆28Dec 12, 2023Updated 2 years ago
- Implementation of Log Gaussian Cox Process in Python for Changepoint Detection using GPFlow☆38Mar 24, 2023Updated 2 years ago
- RL algorithm for stock trading with multiple reward functions☆11Apr 21, 2024Updated last year
- Basic operations prototype/syntax for developers☆12Mar 12, 2023Updated 2 years ago
- A suite of open-ended, non-imitative tasks involving generalizable skills for large language model chatbots and agents to enable bootstra…☆44Jan 31, 2025Updated last year
- ☆36Mar 5, 2025Updated last year
- Implementation of the model from "Faster sorting algorithms discovered using deep reinforcement learning" that discovered an all-new ult…☆11Aug 29, 2023Updated 2 years ago
- ☆11Sep 17, 2024Updated last year
- Here, I provided the solution for exercises of IBM Quantum Challenge 2020☆10Oct 27, 2020Updated 5 years ago
- Neural Error Mitigation of Near-Term Quantum Simulations (arXiv:2105.08086)☆10Jul 6, 2022Updated 3 years ago
- This is a demo project to compare two web scrapping frameworks, Playwright and Selenium and using the new Pipelining tool Dagster☆15Sep 9, 2021Updated 4 years ago
- MirMachine, a command line tool to detect microRNA homologs in genome sequences.☆13Dec 3, 2025Updated 3 months ago
- ☆34Jul 17, 2020Updated 5 years ago
- FinanceGPT-B☆10Mar 26, 2024Updated last year
- Material for the course Theories of Quantum Matter at the University of Cambridge☆11Jan 20, 2023Updated 3 years ago