anshradh / trl_custom
Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.
☆14Updated 2 years ago
Related projects: ⓘ
- Ranking of fine-tuned HF models as base models.☆35Updated last year
- Training and evaluation code for the paper "Headless Language Models: Learning without Predicting with Contrastive Weight Tying" (https:/…☆22Updated 5 months ago
- ☆44Updated 2 months ago
- Embedding Recycling for Language models☆38Updated last year
- SMASHED is a toolkit designed to apply transformations to samples in datasets, such as fields extraction, tokenization, prompting, batchi…☆30Updated 3 months ago
- Training a model without a dataset for natural language inference (NLI)