This repo contains a set of notebooks to reproduce reinforcement learning algorithms.
☆16Nov 21, 2022Updated 3 years ago
Alternatives and similar repositories for rl-implementations
Users that are interested in rl-implementations are comparing it to the libraries listed below
Sorting:
- Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.☆14May 5, 2022Updated 3 years ago
- This package aims to make development with ML-Agents quicker and easier.☆10Oct 31, 2019Updated 6 years ago
- Using a shared file to exchange data between Unity and Python☆13Oct 30, 2021Updated 4 years ago
- Postgres protocol support for finagle☆36Sep 4, 2013Updated 12 years ago
- Translate - a PyTorch Language Library☆10Mar 14, 2019Updated 7 years ago
- A Deep Generative Distance-Based Classifier for Out-of-Domain Detection with Mahalanobis Space☆12Jun 21, 2021Updated 4 years ago
- PrivacyGLUE: A Benchmark Dataset for General Language Understanding in Privacy Policies☆17Sep 5, 2023Updated 2 years ago
- ☆11Mar 6, 2022Updated 4 years ago
- ☆12May 18, 2022Updated 3 years ago
- A simple yet fairly fast scheme byte code interpreter written in ANSI C.☆14Mar 28, 2021Updated 4 years ago
- Nuance Dragon Mobile SDK and ObjectAL☆10Sep 22, 2018Updated 7 years ago
- Generate random playable mazes☆19Aug 12, 2017Updated 8 years ago
- Tab component states in the browser's URL.☆14Feb 28, 2024Updated 2 years ago
- Find the posterior decoding of a long sequence of observations.☆17Jul 29, 2010Updated 15 years ago
- UCI Chess Engine Protocol☆11Aug 11, 2021Updated 4 years ago
- Cray Chapel scheduler for Apache Mesos☆22Mar 3, 2014Updated 12 years ago
- Directed masked autoencoders☆14Updated this week
- class and sample code for Kitronik Pico Motor Driver - 5331☆10Dec 1, 2022Updated 3 years ago
- https://pypi.org/project/intent-suggestions/☆10Sep 6, 2022Updated 3 years ago
- Implementation of Proximal Policy Optimization algorithm on a custom Unity environment.☆17Feb 3, 2022Updated 4 years ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated 11 months ago
- various web scrapers as examples☆17Oct 10, 2020Updated 5 years ago
- Trains Sparse Autoencoders based on outputs from language models☆11Oct 7, 2024Updated last year
- A Verifier for JVM byte code that you can run off-line with detailed error reporting. Great for compiler writers. Useless for everyone e…☆16Jun 7, 2010Updated 15 years ago
- A python implemenation of tabular MuZero for educational purposes☆21Dec 11, 2019Updated 6 years ago
- A Maze Game Using HTML5 Canvas☆11Nov 30, 2015Updated 10 years ago
- ACL 2021 - Defense against Adversarial Attacks in NLP via Dirichlet Neighborhood Ensemble☆18Jun 12, 2023Updated 2 years ago
- Leaning hard attention model by policy gradient with rewards based on active inference.☆22Sep 9, 2017Updated 8 years ago
- 📊 Soothing pastel theme for sc-im☆26Mar 30, 2025Updated 11 months ago
- Prevent tailscale from using specific network interfaces☆17Feb 4, 2026Updated last month
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Dec 9, 2022Updated 3 years ago
- ☆11Mar 13, 2023Updated 3 years ago
- Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference☆51May 1, 2025Updated 10 months ago
- Recurrent Neural Network in Go☆14Jun 11, 2016Updated 9 years ago
- This repo contains the examples for my article about SVG generation on the server via JS☆24May 21, 2014Updated 11 years ago
- ☆14Aug 18, 2024Updated last year
- This is a disciplined Python implementation of the Recursive Least Squares Method☆21Mar 28, 2025Updated 11 months ago
- this is perhaps the most disgusting piece of code I've ever written. vanilla js, not even jquery baby, almost all one file.☆30Aug 12, 2024Updated last year
- ☆14Jun 16, 2023Updated 2 years ago