PrasannS / rlhf-length-biasesView external linksLinks
☆27Mar 13, 2024Updated last year
Alternatives and similar repositories for rlhf-length-biases
Users that are interested in rlhf-length-biases are comparing it to the libraries listed below
Sorting:
- Long Is More for Alignment: A Simple but Tough-to-Beat Baseline for Instruction Fine-Tuning [ICML 2024]☆21May 2, 2024Updated last year
- ☆23Oct 30, 2023Updated 2 years ago
- MeCab model trained with OpenKorPos.☆23Jun 19, 2022Updated 3 years ago
- ACL24☆11Jun 7, 2024Updated last year
- ☆10Jun 5, 2025Updated 8 months ago
- ☆10Oct 28, 2024Updated last year
- KoRean based ELECTRA pre-trained models (KR-ELECTRA) for Tensorflow and PyTorch☆15Feb 13, 2022Updated 4 years ago
- Directional Preference Alignment☆58Sep 23, 2024Updated last year
- Script to pre-train hugginface transformers BART with Tensorflow 2☆35Apr 13, 2023Updated 2 years ago
- Hate speech detection corpus in Korean, shared with EMNLP 2023 paper☆17Apr 19, 2024Updated last year
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆26Oct 14, 2025Updated 4 months ago
- Data processing for the Collective Constitutional AI project (a collaboration between The Collective Intelligence Project & Anthropic)☆26Oct 17, 2023Updated 2 years ago
- ☆20Apr 28, 2021Updated 4 years ago
- ☆19Sep 20, 2022Updated 3 years ago
- Machine Generated Captions for Best Artworks☆22Sep 21, 2022Updated 3 years ago
- Google's Conceptual Captions Dataset translated into Korean☆23Aug 28, 2022Updated 3 years ago
- ☆11Mar 13, 2025Updated 11 months ago
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆32Jan 23, 2025Updated last year
- Character-level Korean ELECTRA Model (음절 단위 한국어 ELECTRA)☆54Jun 12, 2023Updated 2 years ago
- CodeUltraFeedback: aligning large language models to coding preferences (TOSEM 2025)☆73Jun 25, 2024Updated last year
- TyDiP Multilingual Politeness dataset and code☆12Oct 15, 2023Updated 2 years ago
- Bias, Hate classification with KoELECTRA 👿☆27Jun 12, 2023Updated 2 years ago
- 🚀 Implementation of easy-to-use 3D parallelism based on Huggingface Transformers & Microsoft DeepSpeed☆31Feb 5, 2022Updated 4 years ago
- ☆30Feb 16, 2024Updated 2 years ago
- T5-base model for Korean☆27May 20, 2021Updated 4 years ago
- huggingface에 있는 한국어 데이터 세트☆36Oct 10, 2024Updated last year
- SyMuRBench: Benchmark for symbolic music representations☆17Nov 6, 2025Updated 3 months ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- 한국어 심리 상담 데이터셋☆81Jun 20, 2023Updated 2 years ago
- APEACH: Attacking Pejorative Expressions with Analysis on Crowd-generated Hate Speech Evaluation Datasets☆77Feb 5, 2023Updated 3 years ago
- A utility for storing and reading files for Korean LM training 💾☆35Oct 15, 2025Updated 4 months ago
- Create paraphrasing korean sentence with GPT-3☆34Jan 30, 2023Updated 3 years ago
- Mirror-based reflection for Objective-C☆10Jan 18, 2016Updated 10 years ago
- ☆14Nov 19, 2024Updated last year
- ☆10Oct 11, 2022Updated 3 years ago
- ☆37Nov 20, 2021Updated 4 years ago
- ☆30May 18, 2014Updated 11 years ago
- PyTorch Implementation for the paper "Let Me Help You! Neuro-Symbolic Short-Context Action Anticipation" accepted to RA-L'24.☆12Nov 27, 2024Updated last year
- ☆12Aug 6, 2024Updated last year