EasyRLHF aims to provide an easy and minimal interface to train aligned language models, using off-the-shelf solutions and datasets
☆10Dec 12, 2023Updated 2 years ago
Alternatives and similar repositories for EasyRLHF
Users that are interested in EasyRLHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆14Dec 9, 2021Updated 4 years ago
- Korean Benchmark for Korean Legal Language Understanding☆18Nov 16, 2024Updated last year
- Code for paper Target-Guided Dialogue Response Generation Using Commonsense and Data Augmentation☆14Jun 10, 2022Updated 3 years ago
- Google 공식 Rouge Implementation을 한국어에서 사용할 수 있도록 처리☆18Jan 3, 2024Updated 2 years ago
- SKT'22 AI Fellowship, 딥러닝 기반 흑백 이미지 컬러화 기술 개발☆13Jun 7, 2023Updated 2 years ago
- 문장단위로 분절된 나무위키 데이터셋. Releases에서 다운로드 받거나, tfds-korean을 통해 다운로드 받으세요.☆19Jun 16, 2021Updated 4 years ago
- Code for "Goodtriever: Toxicity Mitigation with Retrieval-augmented Language Models"☆25May 30, 2024Updated last year
- An empathetic counselling chatbot. Retrieval-based, uses finetuned LMs for emotion identification and to boost empathy, novelty and fluen…☆17Jun 8, 2023Updated 2 years ago
- code for our EACL 2021 paper: "Challenges in Automated Debiasing for Toxic Language Detection" by Xuhui Zhou, Maarten Sap, Swabha Swayamd…☆19Aug 20, 2021Updated 4 years ago
- ↔️ T5 Machine Translation from English to Korean☆18Aug 11, 2022Updated 3 years ago
- Mini Model Daemon☆12Nov 9, 2024Updated last year
- The source code of the paper 'Dynamic Knowledge Routing Network For Target-Guided Open-Domain Conversation'☆24Mar 24, 2023Updated 3 years ago
- ☆20Apr 1, 2022Updated 3 years ago
- Collection of academic and pseudo-academic events, publications and web sites that spam me☆21Mar 16, 2026Updated last week
- Framework for Algorithmic Correctness Testing of Operators☆16Mar 9, 2026Updated 2 weeks ago
- ☆12Dec 14, 2024Updated last year
- ☆11Feb 25, 2025Updated last year
- test images with not appropriate labels in MNIST dataset☆10Mar 3, 2018Updated 8 years ago
- Collection of apple-native tools for the model context protocol.☆18Apr 2, 2025Updated 11 months ago
- GoldFinch and other hybrid transformer components☆12Dec 9, 2025Updated 3 months ago
- Recording and processing Tobii eyeX and 4C with the standard SDK☆14Apr 12, 2018Updated 7 years ago
- Training a reward model for RLHF using RWKV.☆15Jun 5, 2023Updated 2 years ago
- ☆39Feb 25, 2026Updated 3 weeks ago
- <혼자 만들면서 공부하는 파이썬> 책의 깃허브 자료실☆15Jan 14, 2026Updated 2 months ago
- Python FastApi "Circuit Breaker" implementation☆13Mar 14, 2025Updated last year
- This repository contains all code and experiments for competitive policy gradient (CoPG) algorithm.☆24Aug 1, 2020Updated 5 years ago
- RWKV-7 mini☆12Mar 29, 2025Updated 11 months ago
- Direct Preference Optimization for RWKV, aiming for RWKV-5 and 6.