anshradh / trl_customLinks
Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.
☆14Updated 3 years ago
Alternatives and similar repositories for trl_custom
Users that are interested in trl_custom are comparing it to the libraries listed below
Sorting:
- Google Research☆46Updated 3 years ago
- ☆14Updated last year
- Code for Stage-wise Fine-tuning for Graph-to-Text Generation☆26Updated 2 years ago
- ☆44Updated last year
- This repo contains data and code for the paper "Reasoning over Public and Private Data in Retrieval-Based Systems."☆46Updated last year
- Ranking of fine-tuned HF models as base models.☆36Updated 3 months ago
- Finding semantically meaningful and accurate prompts.☆48Updated 2 years ago
- Code for paper "Do Language Models Have Beliefs? Methods for Detecting, Updating, and Visualizing Model Beliefs"☆28Updated 3 years ago
- Embedding Recycling for Language models☆38Updated 2 years ago
- Agents that build knowledge graphs and explore textual worlds by asking questions☆79Updated 2 years ago
- Documentation effort for the BookCorpus dataset☆34Updated 4 years ago
- RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!☆42Updated 3 years ago
- diagNNose is a Python library that facilitates a broad set of tools for analysing hidden activations of neural models.☆82Updated 2 years ago
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆35Updated last year
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆81Updated 3 years ago
- This is the official PyTorch repo for "UNIREX: A Unified Learning Framework for Language Model Rationale Extraction" (ICML 2022).☆26Updated 2 years ago
- Codes and files for the paper Are Emergent Abilities in Large Language Models just In-Context Learning☆33Updated last year
- Code for Relevance-guided Supervision for OpenQA with ColBERT (TACL'21)☆40Updated 4 years ago
- The repository for the paper "When Do You Need Billions of Words of Pretraining Data?"☆21Updated 5 years ago
- A Benchmark Dataset for Understanding Disfluencies in Question Answering☆64Updated 4 years ago
- ☆13Updated 3 years ago
- Steering Vector Repo from "Extracting Latent Steering Vectors from Pretrained Language Models" - ACL2022 Findings☆11Updated 3 years ago
- Converter from UD-trees to BART representation☆36Updated last year
- ☆40Updated 2 years ago
- ☆30Updated 4 years ago
- Interpretable and efficient predictors using pre-trained language models. Scikit-learn compatible.☆44Updated 2 months ago
- A Toolkit for Distributional Control of Generative Models☆74Updated last month
- Apps built using Inspired Cognition's Critique.☆57Updated 2 years ago
- A diff tool for language models☆44Updated 2 years ago
- This repo contains code to reproduce some of the results presented in the paper "SentenceMIM: A Latent Variable Language Model"☆28Updated 3 years ago