This repo contains a set of notebooks to reproduce reinforcement learning algorithms.
☆16Nov 21, 2022Updated 3 years ago
Alternatives and similar repositories for rl-implementations
Users that are interested in rl-implementations are comparing it to the libraries listed below
Sorting:
- Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.☆14May 5, 2022Updated 3 years ago
- https://pypi.org/project/intent-suggestions/☆10Sep 6, 2022Updated 3 years ago
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Dec 9, 2022Updated 3 years ago
- Pequenos projetos e testes simples em linguagem Python.☆11Jan 28, 2018Updated 8 years ago
- A Maze Game Using HTML5 Canvas☆11Nov 30, 2015Updated 10 years ago
- FactNews is the first dataset to predict sentence-level factuality of news reporting. Furthemore, we provide baseline results for sentenc…☆11Jun 12, 2025Updated 8 months ago
- Directed masked autoencoders☆14Feb 20, 2026Updated last week
- web programming course (COMPSCI 326, UMass Amherst)☆14Sep 13, 2022Updated 3 years ago
- Gerador de texto treinado nas obras de João Guimarães Rosa☆11Jul 14, 2021Updated 4 years ago
- The official implementation of the paper "Text Classification in the Wild: a Large-scale Long-tailed Name Normalization Dataset"(ICASSP 2…☆12Feb 19, 2023Updated 3 years ago
- Main repository for Stateoftheart AI's development module.☆44May 11, 2021Updated 4 years ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- SentiStorm - Real-time Twitter Sentiment Classification based on Apache Storm☆10May 22, 2018Updated 7 years ago
- Code for the EMNLP 2022 Findings short paper "SAT: Improving Semi-Supervised Text Classification with Simple Instance-Adaptive Self-Train…☆13Feb 25, 2023Updated 3 years ago
- Neural Turing Machine☆13Jun 18, 2018Updated 7 years ago
- Code for Paper "Effective Multi-agent Reinforcement Learning Control with Relative Entropy Regularization".☆13Sep 27, 2023Updated 2 years ago
- T5Patches is a set of tools for fast and targeted editing of generative language models built with T5X.☆12May 31, 2024Updated last year
- ☆10Jun 16, 2021Updated 4 years ago
- ☆10Oct 3, 2023Updated 2 years ago
- Deep Boltzmann Machines in R^N dimensions☆11May 14, 2014Updated 11 years ago
- Source code for NeurIPS 2020 paper "Node Classification on Graphs with Few-Shot Novel Labels via Meta Transformed Network Embedding"☆10Nov 17, 2020Updated 5 years ago
- Code for EMNLP'24 paper - On Diversified Preferences of Large Language Model Alignment☆16Aug 6, 2024Updated last year
- ☆11Nov 27, 2022Updated 3 years ago
- ~ Implementation of LSTM ANN in FPGA with VHDL☆10Oct 7, 2019Updated 6 years ago
- A simple chatbot sample on chatbase☆11May 18, 2020Updated 5 years ago
- Background materials for the article "Productivity Assessment of Neural Code Completion"☆13Jul 11, 2023Updated 2 years ago
- ~ Just Another Persian Compiler☆12Nov 23, 2021Updated 4 years ago
- ☆11Mar 13, 2023Updated 2 years ago
- ☆14Sep 30, 2022Updated 3 years ago
- ☆11Updated this week
- ☆11Sep 22, 2019Updated 6 years ago
- ☆10Nov 30, 2022Updated 3 years ago
- Pretrained segmenter models for Portuguese legislative text.☆13Oct 13, 2024Updated last year
- AssistBot - Chatbots in JavaScript☆12Dec 25, 2020Updated 5 years ago
- UCI Chess Engine Protocol☆11Aug 11, 2021Updated 4 years ago
- xlvector's solution of github contest☆33Aug 30, 2009Updated 16 years ago
- Google Research☆46Oct 29, 2022Updated 3 years ago
- ☆14Jun 16, 2023Updated 2 years ago
- ☆10Jun 28, 2015Updated 10 years ago