A python implementation of the concepts in the book "Reinforcement Learning: An Introduction" by R.S. Sutton and A. G. Barto.
☆21Jul 13, 2020Updated 5 years ago
Alternatives and similar repositories for reinforcement-learning-an-introduction
Users that are interested in reinforcement-learning-an-introduction are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Sep 8, 2024Updated last year
- Working Memory Attack on LLMs☆18May 27, 2025Updated last year
- ☆15Jul 8, 2023Updated 2 years ago
- The code for experiments conducted to verify the correctness of mirror learning.☆11Jun 3, 2022Updated 4 years ago
- Control of 2D Rayleigh Benard Convection using Deep Reinforcement Learning with Tensorforce and Shenfun.☆22Jul 5, 2023Updated 2 years ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Our Sentimental LIAR dataset is a modified and further extended version of the LIAR extension introduced by Kirilin et al. In our dataset…☆16Mar 31, 2022Updated 4 years ago
- Unofficial implementation of "Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection"☆27Jul 6, 2024Updated last year
- ConvNet Implementation: An Object Oriented Approach using Keras API.☆23Jan 17, 2020Updated 6 years ago
- Simple single file implementations of Reinforcement Learning algorithms in Julia☆23Feb 15, 2025Updated last year
- Source Code for KDD'19 paper "SurfCon: Synonym Discovery on Privacy-Aware Clinical Data"☆10Apr 10, 2020Updated 6 years ago
- Code for the competition "CCKS 2020: 面向中文短文本的实体链指任务" , see https://www.biendata.xyz/competition/ccks_2020_el/☆14Dec 1, 2020Updated 5 years ago
- ☆12Oct 29, 2022Updated 3 years ago
- Various reinforcement learning algorithms written in Jax + Flax☆26Jun 24, 2023Updated 2 years ago
- ☆48Sep 29, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Reinforcement Learning inside a 3D soccer simulation☆38Sep 15, 2024Updated last year
- ☆15May 28, 2020Updated 6 years ago
- A list of research resources that I've appreciated.☆12Dec 10, 2019Updated 6 years ago
- The repo for ACL2021 findings paper - Don't Miss the Labels: Label-semantic Argumented Meta-Learner for Few-Shot Text Classification☆15Mar 24, 2022Updated 4 years ago
- Paper List for Dialogue and Interactive Systems☆15Jun 5, 2020Updated 6 years ago
- AGC 5차 대회 소스 코드 저장소입니다.☆10Jun 6, 2023Updated 3 years ago
- Official implementation of the NeurIPS 2023 paper "Discovering General Reinforcement Learning Algorithms with Adversarial Environment Des…☆36Jun 28, 2024Updated last year
- Dockerfile to create an image with OpenFOAM-plus and PyTorch support☆45Oct 19, 2024Updated last year
- Predicting wave propagation on shallow water with deep neural networks☆23Oct 3, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- ☆16Nov 20, 2023Updated 2 years ago
- Simple VectorBT Streamlit Backtesting App☆23Jan 29, 2024Updated 2 years ago
- ☆10Oct 12, 2021Updated 4 years ago
- Adversarial Example Attacks on Policy Learners☆40Jul 23, 2020Updated 5 years ago
- Application and blog explaining my interpretations of In-run Data Shapley☆31Jan 30, 2025Updated last year
- Novelty Detection with Reconstruction along Projection Pathway☆10May 10, 2021Updated 5 years ago
- This repo consists of the code as discussed in the Medium blog.☆17Sep 10, 2023Updated 2 years ago
- Machine learning, Deep Learning, CNN with PyTorch☆81Apr 23, 2020Updated 6 years ago
- Finding of ACL2023: Clustering-Aware Negative Sampling for Unsupervised Sentence Representation☆13Oct 16, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Scalable Detection of Concept Drifts on Data Streams with Parallel Adaptive Windowing☆10Dec 14, 2018Updated 7 years ago
- Stock closing and opening forecasting using Deep neural network and LSTM(technical indicators included)☆19Oct 22, 2017Updated 8 years ago
- ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble☆17Jun 12, 2023Updated 3 years ago
- A hydra integrated template for LG-Dacon Competetion☆13Jul 20, 2021Updated 4 years ago
- A Survey of Neural Dialogue Systems☆19Dec 31, 2021Updated 4 years ago
- LocalStack website☆12Nov 21, 2023Updated 2 years ago
- ☆11Sep 17, 2020Updated 5 years ago