anshradh/trl_custom

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/anshradh/trl_custom)

anshradh / trl_custom

Applying Reinforcement Learning from Human Feedback to language models to teach them to write short story responses to writing prompts.

☆13

Alternatives and similar repositories for trl_custom

Users that are interested in trl_custom are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

lvwerra / rl-implementations
View on GitHub
This repo contains a set of notebooks to reproduce reinforcement learning algorithms.
☆17Nov 21, 2022Updated 3 years ago
web-archive-group / heritrix-walkthrough
View on GitHub
☆10Jun 10, 2016Updated 10 years ago
EleutherAI / semantic-memorization
View on GitHub
☆44Nov 17, 2024Updated last year
facebookresearch / Permutation-Equivariant-Seq2Seq
View on GitHub
Humans understand novel sentences by composing meanings and roles of core language components. In contrast, neural network models for nat…
☆27Apr 23, 2020Updated 6 years ago
neulab / neural-lpcfg
View on GitHub
The Return of Lexical Dependencies: Neural Lexicalized PCFGs (TACL)
☆33Sep 22, 2025Updated 10 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
samuki / reinforce-joey
View on GitHub
This is a fork of the awesome Joey-NMT with Reinforcement Learning algorithms like Policy Gradient, MRT and Advantage Actor Critic.
☆27Feb 10, 2023Updated 3 years ago
cmsj / ApplePrivateHeaders
View on GitHub
A smattering of header files dumped using classdump-dyld
☆14Apr 28, 2021Updated 5 years ago
jebivid / adversarial-nmt
View on GitHub
☆10Jun 23, 2018Updated 8 years ago
mnylc / islandora_multi_importer
View on GitHub
This is a flexible, twig based, all cmodel, tabular data to islandora Object importer with optional ZeroMQ processing
☆16Nov 29, 2020Updated 5 years ago
inspire-group / robustness-via-transport
View on GitHub
☆12Sep 26, 2019Updated 6 years ago
HamedMinaeizaeim / twitter_scraper_without_API
View on GitHub
☆11Aug 9, 2022Updated 3 years ago
nmrksic / LEAR
View on GitHub
Specialising Word Vectors for Lexical Entailment
☆29Sep 13, 2018Updated 7 years ago
zh1yu4nyu / CodeIPPrompt
View on GitHub
https://icml.cc/virtual/2023/poster/24354
☆10Aug 15, 2023Updated 2 years ago
haixuanTao / bert-onnx-rs-pipeline
View on GitHub
This demo showcase the use of onnxruntime-rs with a GPU on CUDA 11 to run Bert in a data pipeline with Rust.
☆16Feb 7, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
alexa / ramen
View on GitHub
A software for transferring pre-trained English models to foreign languages
☆20Mar 20, 2023Updated 3 years ago
fpigerre / IssuesDownload
View on GitHub
A java application that downloads GitHub issues to a csv file
☆27Aug 1, 2018Updated 7 years ago
csmutz / pdfrate
View on GitHub
☆13Aug 31, 2024Updated last year
aypan17 / reward-misspecification
View on GitHub
☆10Mar 13, 2023Updated 3 years ago
bghojogh / Generalized-Eigenvalue-Problem
View on GitHub
The code for generalized eigenvalue problem
☆10Jun 11, 2020Updated 6 years ago
JaderDias / download-from-common-crawl
View on GitHub
☆26Mar 20, 2024Updated 2 years ago
dmvaldman / SelectionSearch
View on GitHub
Chrome Extension to find related web links by selecting text on a website and doing semantic search.
☆16Nov 5, 2025Updated 8 months ago
RemiLeblond / SeaRNN-open
View on GitHub
Open source implementation of SeaRNN (ICLR 2018, https://openreview.net/forum?id=HkUR_y-RZ)
☆48Jul 4, 2018Updated 8 years ago
allenai / show-your-work
View on GitHub
Relevant code for the "Show Your Work" paper, EMNLP 2019.
☆18Sep 9, 2019Updated 6 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
allenai / bff
View on GitHub
☆39Apr 17, 2024Updated 2 years ago
liujch1998 / memo-trap
View on GitHub
☆23Jan 25, 2023Updated 3 years ago
ayuLiao / export-yuque
View on GitHub
☆14Sep 30, 2022Updated 3 years ago
zhuotongchen / Towards-Robust-Neural-Networks-via-Close-loop-Control
View on GitHub
☆13Jan 30, 2021Updated 5 years ago
uclanlp / ProbeGrammarRobustness
View on GitHub
Source code for ACL2020: On the Robustness of Language Encoders against Grammatical Errors
☆10Jul 6, 2023Updated 3 years ago
llan-ml / MetaTNE
View on GitHub
Source code for NeurIPS 2020 paper "Node Classification on Graphs with Few-Shot Novel Labels via Meta Transformed Network Embedding"
☆10Nov 17, 2020Updated 5 years ago
lucky-bai / kaggle-speech-recognition
View on GitHub
TensorFlow Speech Recognition Challenge (Top 15%)
☆14Jan 16, 2018Updated 8 years ago
Brenden2008 / taxy-ai-light
View on GitHub
☆16Apr 3, 2023Updated 3 years ago
Manem-Lab / Lung-DDPM
View on GitHub
☆16Aug 29, 2025Updated 10 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
swgillespie / pine
View on GitHub
An ML-like language featuring native compilation.
☆18Jul 6, 2015Updated 11 years ago
bangyen / esolangs
View on GitHub
Interpreters and compilers for esoteric programming languages, from stack-based to modern register-based systems
☆12Sep 18, 2025Updated 10 months ago
OmarMohammed88 / AR-Emotion-Recognition
View on GitHub
An implementation of the paper titled "Arabic Speech Emotion Recognition Employing Wav2vec2.0 and HuBERT Based on BAVED Dataset" https://…
☆16Feb 17, 2022Updated 4 years ago
usaito / kdd2022-tutorial
View on GitHub
☆12Aug 13, 2022Updated 3 years ago
LinxinS97 / NLPBench
View on GitHub
NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models
☆10Oct 27, 2023Updated 2 years ago
bortlip / search-helper
View on GitHub
☆23Apr 2, 2023Updated 3 years ago
yikee / Knowledge_Conflict
View on GitHub
Resolving Knowledge Conflicts in Large Language Models, COLM 2024
☆18Oct 7, 2025Updated 9 months ago