Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.
☆13May 5, 2022Updated 4 years ago
Alternatives and similar repositories for trl_custom
Users that are interested in trl_custom are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This repo contains a set of notebooks to reproduce reinforcement learning algorithms.☆16Nov 21, 2022Updated 3 years ago
- MLE-Guided Parameter Search (AAAI 2021)☆12Sep 16, 2021Updated 4 years ago
- ☆44Nov 17, 2024Updated last year
- ☆10Jun 10, 2016Updated 9 years ago
- This is a joint project between Helmholtz Imaging (located at DKFZ) and Lin Yang and Otmar Schmid (Helmholtz Munich).☆13Nov 6, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Humans understand novel sentences by composing meanings and roles of core language components. In contrast, neural network models for nat…☆27Apr 23, 2020Updated 6 years ago
- The Return of Lexical Dependencies: Neural Lexicalized PCFGs (TACL)☆33Sep 22, 2025Updated 7 months ago
- DKPro C4CorpusTools is a collection of tools for processing CommonCrawl corpus, including Creative Commons license detection, boilerplate…☆52Jun 12, 2020Updated 5 years ago
- MapReduce FFmpeg☆24Aug 12, 2013Updated 12 years ago
- A software for transferring pre-trained English models to foreign languages☆19Mar 20, 2023Updated 3 years ago
- This is a flexible, twig based, all cmodel, tabular data to islandora Object importer with optional ZeroMQ processing☆16Nov 29, 2020Updated 5 years ago
- Python bot framework for Lexemes on Wikidata☆19Feb 6, 2021Updated 5 years ago
- A java application that downloads GitHub issues to a csv file☆27Aug 1, 2018Updated 7 years ago
- https://icml.cc/virtual/2023/poster/24354☆10Aug 15, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- A metric learning method to learn a provably robust Mahalanobis distance☆10Jan 29, 2022Updated 4 years ago
- A large database of artificial neural network statistics during training☆15Dec 8, 2020Updated 5 years ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆17Mar 31, 2025Updated last year
- various web scrapers as examples☆17Oct 10, 2020Updated 5 years ago
- Trains Sparse Autoencoders based on outputs from language models☆11Oct 7, 2024Updated last year
- Interactive and static 3D visualisation for functional brain mapping☆17Sep 25, 2024Updated last year
- Discontinuous Hamiltonian Monte Carlo in JAX☆42Feb 24, 2020Updated 6 years ago
- Fooling neural based speech recognition systems.☆14Jun 9, 2017Updated 8 years ago
- This work corroborates a run-time Trojan detection method exploiting STRong Intentional Perturbation of inputs, is a multi-domain Trojan …☆10Mar 7, 2021Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Tamil TTS system in PHP☆42Aug 26, 2017Updated 8 years ago
- ☆13Aug 31, 2024Updated last year
- Repository for code from "On Adversarial Removal of Hypothesis-only Bias in Natural Language Inference" (StarSem 2019) and "Don’t Take th…☆15Apr 6, 2020Updated 6 years ago
- ☆10Mar 13, 2023Updated 3 years ago
- ☆38Apr 17, 2024Updated 2 years ago
- Collections-Tamil-Tanslation☆26Apr 5, 2023Updated 3 years ago
- ☆24Nov 19, 2024Updated last year
- ☆23Jan 25, 2023Updated 3 years ago
- ☆14Sep 30, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Knowledge Infused Decoding☆70Dec 31, 2023Updated 2 years ago
- ☆13Jan 30, 2021Updated 5 years ago
- ☆11Sep 22, 2019Updated 6 years ago
- xlvector's solution of github contest☆33Aug 30, 2009Updated 16 years ago
- Open python perfusion tool for CTP and MRP ||| MRI DSC CT toolbox toolkit |||☆49Nov 20, 2025Updated 5 months ago
- Source code for ACL2020: On the Robustness of Language Encoders against Grammatical Errors☆10Jul 6, 2023Updated 2 years ago
- ☆31Apr 10, 2023Updated 3 years ago