lvwerra / rl-implementationsLinks
This repo contains a set of notebooks to reproduce reinforcement learning algorithms.
☆15Updated 2 years ago
Alternatives and similar repositories for rl-implementations
Users that are interested in rl-implementations are comparing it to the libraries listed below
Sorting:
- Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"☆34Updated 2 years ago
- ☆31Updated 2 years ago
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆46Updated last year
- Shakespeare transformer fine-tuned to generate positive sentiment samples using RLHF☆9Updated 2 years ago
- Implementation of "Analysing Mathematical Reasoning Abilities of Neural Models"☆30Updated 2 years ago
- Parallel data preprocessing for NLP and ML.☆34Updated 8 months ago
- Evaluation suite for large-scale language models.☆126Updated 3 years ago
- Google Research☆46Updated 2 years ago
- ☆13Updated 6 years ago
- A Toolkit for Distributional Control of Generative Models☆73Updated last year
- Amos optimizer with JEstimator lib.☆82Updated last year
- Functional local implementations of main model parallelism approaches☆95Updated 2 years ago
- A lightweight PyTorch implementation of the Transformer-XL architecture proposed by Dai et al. (2019)☆37Updated 2 years ago
- Train very large language models in Jax.☆204Updated last year
- Agents that build knowledge graphs and explore textual worlds by asking questions☆79Updated last year
- One stop shop for all things carp☆59Updated 2 years ago
- For experiments involving instruct gpt. Currently used for documenting open research questions.☆71Updated 2 years ago
- Experiments on GPT-3's ability to fit numerical models in-context.☆14Updated 2 years ago
- Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"☆59Updated 3 years ago
- Official code release for the paper Coder Reviewer Reranking for Code Generation.☆45Updated 2 years ago
- A case study of efficient training of large language models using commodity hardware.☆68Updated 2 years ago
- An attempt to merge ESBN with Transformers, to endow Transformers with the ability to emergently bind symbols☆16Updated 3 years ago
- ☆41Updated 10 months ago
- HomebrewNLP in JAX flavour for maintable TPU-Training☆50Updated last year
- Code repository for the NAACL 2022 paper "ExSum: From Local Explanations to Model Understanding"☆64Updated 3 years ago
- Implementation of the specific Transformer architecture from PaLM - Scaling Language Modeling with Pathways - in Jax (Equinox framework)☆187Updated 3 years ago
- Automatically generate simple meta-learning tasks from a very large space☆15Updated last year
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated 6 months ago
- Code for running the experiments in Deep Subjecthood: Higher Order Grammatical Features in Multilingual BERT☆17Updated last year
- Collaborative inference of latent diffusion via hivemind☆12Updated 2 years ago