MichaelEinhorn / trl-textworldLinks
☆12Updated 2 years ago
Alternatives and similar repositories for trl-textworld
Users that are interested in trl-textworld are comparing it to the libraries listed below
Sorting:
- Code accompanying the paper Pretraining Language Models with Human Preferences☆181Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆86Updated last year
- ☆34Updated last year
- ☆27Updated last year
- ☆180Updated 2 years ago
- Open-source Human Feedback Library☆11Updated last year
- Official Code for M-RᴇᴡᴀʀᴅBᴇɴᴄʜ: Evaluating Reward Models in Multilingual Settings (ACL 2025 Main)☆33Updated 2 months ago
- ☆13Updated 2 years ago
- ☆22Updated last year
- Code and Data Repo for the CoNLL Paper -- Future Lens: Anticipating Subsequent Tokens from a Single Hidden State☆18Updated last year
- ☆32Updated 3 weeks ago
- ☆159Updated 2 years ago
- ☆72Updated 2 years ago
- Utilities for the HuggingFace transformers library☆70Updated 2 years ago
- ☆29Updated last year
- A repository for transformer critique learning and generation☆90Updated last year
- A library for efficient patching and automatic circuit discovery.☆74Updated 3 weeks ago
- The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".☆70Updated last year
- ☆39Updated last year
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆178Updated 3 years ago
- Official code from the paper "Offline RL for Natural Language Generation with Implicit Language Q Learning"☆209Updated 2 years ago
- ☆21Updated last year
- The GitHub repo for Goal Driven Discovery of Distributional Differences via Language Descriptions☆70Updated 2 years ago
- ☆34Updated 2 years ago
- Materials for "Prompting is not a substitute for probability measurements in large language models" (EMNLP 2023)☆24Updated last year
- A library to create and manage configuration files, especially for machine learning projects.☆79Updated 3 years ago
- ☆75Updated last year
- ☆289Updated last year
- Repository for the code of the "PPL-MCTS: Constrained Textual Generation Through Discriminator-Guided Decoding" paper, NAACL'22☆66Updated 2 years ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆209Updated this week