This repo contains a set of notebooks to reproduce reinforcement learning algorithms.
☆16Nov 21, 2022Updated 3 years ago
Alternatives and similar repositories for rl-implementations
Users that are interested in rl-implementations are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.☆13May 5, 2022Updated 3 years ago
- Implementation of "Analysing Mathematical Reasoning Abilities of Neural Models"☆30Mar 25, 2023Updated 3 years ago
- ☆17Dec 15, 2023Updated 2 years ago
- Using a shared file to exchange data between Unity and Python☆13Oct 30, 2021Updated 4 years ago
- An opensource implementation of kanerva coding for use in reinforcement learning research☆11Mar 28, 2026Updated last week
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆10Nov 23, 2020Updated 5 years ago
- Go tool for converting PDF to Markdown along with images.☆16Feb 15, 2025Updated last year
- Postgres protocol support for finagle☆36Sep 4, 2013Updated 12 years ago
- 📽 Python package to live stream ML-Agents training process from Google Colab to Twitch/YouTube server.☆14Mar 27, 2022Updated 4 years ago
- Recovered from https://archive.softwareheritage.org/browse/origin/directory/?origin_url=https://github.com/uktrade/sqlite-s3vfs☆40Dec 30, 2025Updated 3 months ago
- A Deep Generative Distance-Based Classifier for Out-of-Domain Detection with Mahalanobis Space☆12Jun 21, 2021Updated 4 years ago
- ~ Just Another Persian Compiler☆12Updated this week
- A Unity project to manage multiple runs of the Unity Machine Learning program☆16Jul 22, 2019Updated 6 years ago
- A simple template for TensorFlow's highly efficient CudnnLSTM module☆11Jun 8, 2018Updated 7 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Continual Learning with Gated Incremental Memories for Sequential Data Processing. IJCNN 2020. Continual Learning with Recurrent Neural N…☆15Oct 13, 2021Updated 4 years ago
- Nuance Dragon Mobile SDK and ObjectAL☆10Sep 22, 2018Updated 7 years ago
- DOTS compatible version of ML-Agents☆20Oct 4, 2021Updated 4 years ago
- Neural Turing Machine☆13Jun 18, 2018Updated 7 years ago
- Reinforcement learning hover bike race in Unity☆17Mar 22, 2021Updated 5 years ago
- A bot for automatically completing the KAIST safety course☆10Aug 29, 2023Updated 2 years ago
- Find the posterior decoding of a long sequence of observations.☆17Jul 29, 2010Updated 15 years ago
- UCI Chess Engine Protocol☆11Aug 11, 2021Updated 4 years ago
- Easy to understand applications with rust just for having fun☆13Aug 25, 2025Updated 7 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- class and sample code for Kitronik Pico Motor Driver - 5331☆10Dec 1, 2022Updated 3 years ago
- Implementation of Proximal Policy Optimization algorithm on a custom Unity environment.☆17Feb 3, 2022Updated 4 years ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated last year
- Trains Sparse Autoencoders based on outputs from language models☆11Oct 7, 2024Updated last year
- A Verifier for JVM byte code that you can run off-line with detailed error reporting. Great for compiler writers. Useless for everyone e…☆16Jun 7, 2010Updated 15 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- Prediction by Partial Matching☆16Apr 3, 2020Updated 6 years ago
- Get rid of AUT login on servers☆14Dec 8, 2018Updated 7 years ago
- ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble☆17Jun 12, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆25Jul 19, 2014Updated 11 years ago
- Homogeneous Transformation Matrices and Quaternions☆14Apr 24, 2023Updated 2 years ago
- Style transfer in text using cycle-consistent WGANs☆17Jul 11, 2018Updated 7 years ago
- ☆10Mar 13, 2023Updated 3 years ago
- Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference☆51May 1, 2025Updated 11 months ago
- For Certified Robustness to Text Adversarial Attacks by Randomized [MASK]☆17Oct 8, 2024Updated last year
- My PhD thesis. I defended on the 30th of October, 2020! See https://github.com/eleurent/phd-defense/☆16Sep 21, 2021Updated 4 years ago