This repo contains a set of notebooks to reproduce reinforcement learning algorithms.
☆16Nov 21, 2022Updated 3 years ago
Alternatives and similar repositories for rl-implementations
Users that are interested in rl-implementations are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.☆13May 5, 2022Updated 3 years ago
- Implementation of "Analysing Mathematical Reasoning Abilities of Neural Models"☆30Mar 25, 2023Updated 3 years ago
- An opensource implementation of kanerva coding for use in reinforcement learning research☆11Mar 28, 2026Updated last month
- Repository of examples for the drones demystified! educational project☆11Jun 12, 2017Updated 8 years ago
- ☆10Nov 23, 2020Updated 5 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Go tool for converting PDF to Markdown along with images.☆17Feb 15, 2025Updated last year
- ☆13Mar 12, 2021Updated 5 years ago
- ~ Just Another Persian Compiler☆12Apr 9, 2026Updated 3 weeks ago
- PrivacyGLUE: A Benchmark Dataset for General Language Understanding in Privacy Policies☆18Sep 5, 2023Updated 2 years ago
- A Unity project to manage multiple runs of the Unity Machine Learning program☆16Jul 22, 2019Updated 6 years ago
- A simple template for TensorFlow's highly efficient CudnnLSTM module☆11Jun 8, 2018Updated 7 years ago
- ☆11Mar 6, 2022Updated 4 years ago
- ☆12May 18, 2022Updated 3 years ago
- Source code for ScaleGrad☆19Dec 28, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- DOTS compatible version of ML-Agents☆20Oct 4, 2021Updated 4 years ago
- Fibonacci sequence in every language by every algorithm☆16Apr 19, 2020Updated 6 years ago
- Reinforcement learning hover bike race in Unity☆17Mar 22, 2021Updated 5 years ago
- Tab component states in the browser's URL.☆14Feb 28, 2024Updated 2 years ago
- Paraphrase Generation Using Deep Reinforcement Learning - MSc Thesis☆18Jun 10, 2020Updated 5 years ago
- Learn Generalized Representations of Video Games from Pixels | New Sports10 Dataset (175 games)☆20Jun 21, 2021Updated 4 years ago
- Easy to understand applications with rust just for having fun☆13Aug 25, 2025Updated 8 months ago
- ☆18Oct 18, 2021Updated 4 years ago
- Directed masked autoencoders☆14Mar 25, 2026Updated last month
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- JAX implementation of Large Language Models. You can train GPT-2-like model with 青空文庫 (aozora bunko-clean dataset) or any other text dat…☆13Aug 5, 2024Updated last year
- Implementing an agent for Tetris (GB) using genetics algorithm☆19Jul 25, 2024Updated last year
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated last year
- Trains Sparse Autoencoders based on outputs from language models☆11Oct 7, 2024Updated last year
- An open source implementation of CLIP☆22Nov 6, 2024Updated last year
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- Get rid of AUT login on servers☆14Dec 8, 2018Updated 7 years ago
- Materials for "Transformers from the Ground Up" at PyData Jeddah on August 5, 2021☆20Aug 5, 2021Updated 4 years ago
- Style transfer in text using cycle-consistent WGANs☆17Jul 11, 2018Updated 7 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 📊 Soothing pastel theme for sc-im☆26Mar 30, 2025Updated last year
- Imagination Augmented Agents in TensorFlow☆20Oct 21, 2018Updated 7 years ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Dec 9, 2022Updated 3 years ago
- ☆10Mar 13, 2023Updated 3 years ago
- Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference☆51May 1, 2025Updated last year
- Versatile, flexible and dynamic launch configurations for the Robot Operating System (ROS 1) using Python (v2 & 3)☆12Apr 12, 2023Updated 3 years ago
- ~ Implementation of LSTM ANN in FPGA with VHDL☆10Apr 9, 2026Updated 3 weeks ago