PrasannS/rlhf-length-biases

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/PrasannS/rlhf-length-biases)

PrasannS / rlhf-length-biases

☆27

Alternatives and similar repositories for rlhf-length-biases

Users that are interested in rlhf-length-biases are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

eth-lre / LLM_ICL
View on GitHub
ACL24
☆11Jun 7, 2024Updated 2 years ago
tml-epfl / long-is-more-for-alignment
View on GitHub
Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]
☆21May 2, 2024Updated 2 years ago
Data-Intelligence-Lab / DEFT-korean-alpaca
View on GitHub
☆23Oct 30, 2023Updated 2 years ago
openkorpos / model-mecab
View on GitHub
MeCab model trained with OpenKorPos.
☆23Jun 19, 2022Updated 4 years ago
teddysum / korean_evaluation
View on GitHub
☆11Jun 5, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
saffronh / ccai
View on GitHub
Data processing for the Collective Constitutional AI project (a collaboration between The Collective Intelligence Project & Anthropic)
☆26Oct 17, 2023Updated 2 years ago
3DLLM-Mem / 3DLLM-Mem
View on GitHub
☆27Jun 5, 2025Updated last year
HeegyuKim / korouge
View on GitHub
Google 공식 Rouge Implementation을 한국어에서 사용할 수 있도록 처리
☆17Jan 3, 2024Updated 2 years ago
MrBananaHuman / PangyoCorpora
View on GitHub
☆38Oct 4, 2023Updated 2 years ago
teddysum / Korean_SC_2023
View on GitHub
☆10Oct 28, 2024Updated last year
snunlp / KR-ELECTRA
View on GitHub
KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch
☆15Feb 13, 2022Updated 4 years ago
cosmoquester / transformers-bart-pretrain
View on GitHub
Script to pre-train hugginface transformers BART with Tensorflow 2
☆35Apr 13, 2023Updated 3 years ago
TianHongZXY / qaap
View on GitHub
[EMNLP 2023] Question Answering as Programming for Solving Time-Sensitive Questions
☆12Dec 18, 2023Updated 2 years ago
formidable-stella / ShareGPT-translation
View on GitHub
☆21May 24, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
OpenBMB / UltraFeedback
View on GitHub
A large-scale, fine-grained, diverse preference dataset (and models).
☆368Dec 29, 2023Updated 2 years ago
tml-epfl / icl-alignment
View on GitHub
Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]
☆33Jan 23, 2025Updated last year
comet-ml / blog-serving-hugging-face-models
View on GitHub
☆20Apr 28, 2021Updated 5 years ago
ssu-humane / K-HATERS
View on GitHub
Hate speech detection corpus in Korean, shared with EMNLP 2023 paper
☆17Apr 19, 2024Updated 2 years ago
martin-wey / CodeUltraFeedback
View on GitHub
CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)
☆76Jun 25, 2024Updated 2 years ago
ChengpengLi1003 / DotaMath
View on GitHub
☆30Dec 27, 2024Updated last year
nishadsinghi / sc-genrm-scaling
View on GitHub
[COLM 2025] Official code for "When To Solve, When To Verify: Compute-Optimal Problem Solving and Generative Verification for LLM Reasoni…
☆15Oct 31, 2025Updated 8 months ago
songys / huggingface_KoreanDataset
View on GitHub
huggingface에 있는 한국어 데이터 세트
☆37Oct 10, 2024Updated last year
Columbia-NLP-Lab / LionAlignment
View on GitHub
☆12Aug 6, 2024Updated last year
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
prakharguptaz / EDGE-exemplars
View on GitHub
Code for the paper - Controlling Dialogue Generation with Semantic Exemplars (Naacl 2021) A semantic exemplar based retrieve-refine appro…
☆18Mar 26, 2021Updated 5 years ago
nawnoes / NamuwikiExtractor
View on GitHub
나무위키덤프에서 정제된 텍스트를 얻기 위한 NamuwikiExtractor
☆20Feb 27, 2022Updated 4 years ago
openfeedback / superhf
View on GitHub
Open-source Human Feedback Library
☆11Oct 25, 2023Updated 2 years ago
tunib-ai / artwork_captions
View on GitHub
Machine Generated Captions for Best Artworks
☆22Sep 21, 2022Updated 3 years ago
XuchanBao / behavioral-self-awareness
View on GitHub
☆37Feb 20, 2025Updated last year
princeton-nlp / EvalConvQA
View on GitHub
[ACL 2022] Ditch the Gold Standard: Re-evaluating Conversational Question Answering
☆43Jun 18, 2022Updated 4 years ago
Genius1237 / TyDiP
View on GitHub
TyDiP Multilingual Politeness dataset and code
☆12Oct 15, 2023Updated 2 years ago
chemicaltree / tetra
View on GitHub
☆10Sep 14, 2022Updated 3 years ago
CosineAI / experiments
View on GitHub
Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.
☆14Sep 4, 2024Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
rosieyzh / openrlhf-pretrain
View on GitHub
Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"
☆29Oct 14, 2025Updated 9 months ago
utrerf / robust_transfer_learning
View on GitHub
Accelerating Transfer Learning with Robust Neural Nets
☆11Oct 2, 2020Updated 5 years ago
EIT-NLP / BLEUless_DocMT
View on GitHub
☆14Nov 19, 2024Updated last year
google / wmt-mqm-human-evaluation
View on GitHub
☆100Sep 25, 2025Updated 10 months ago
EIT-NLP / AccuracyParadox-RLHF
View on GitHub
[EMNLP 2024 Main] Official implementation of the paper "The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Langua…
☆12Nov 11, 2024Updated last year
sail-sg / lm-random-memory-access
View on GitHub
☆15Mar 12, 2024Updated 2 years ago
pHaeusler / tic_tac_transformer
View on GitHub
☆11Sep 26, 2023Updated 2 years ago