hyunwoongko/nanoRLHF

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hyunwoongko/nanoRLHF)

hyunwoongko / nanoRLHF

nanoRLHF: from-scratch journey into how LLMs and RLHF really work.

☆194

Alternatives and similar repositories for nanoRLHF

Users that are interested in nanoRLHF are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Zerohertz / Instruct_KR_2025_Summer_Meetup_vLLM
View on GitHub
🎹 Instruct.KR 2025 Summer Meetup: 오픈소스 LLM, vLLM으로 Production까지 🎹
☆23Aug 2, 2025Updated 11 months ago
jason9693 / oslo-kogpt-finetunig
View on GitHub
kogpt를 oslo로 파인튜닝하는 예제.
☆23Aug 26, 2022Updated 3 years ago
js-lee-AI / awesome-llm-agent-papers
View on GitHub
A curated, continuously updated reading list of 200+ papers on LLM agents: planning, memory, tool use, multi-agent, evaluation & safety. …
☆36Updated this week
instructkr / LogicKor
View on GitHub
한국어 언어모델 다분야 사고력 벤치마크
☆209Oct 17, 2024Updated last year
J-Seo / KoCommonGEN-V2
View on GitHub
KoCommonGEN v2: A Benchmark for Navigating Korean Commonsense Reasoning Challenges in Large Language Models
☆25Aug 24, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
qwopqwop200 / ko-arena-hard-auto
View on GitHub
Ko-Arena-Hard-Auto: An automatic LLM benchmark for Korean
☆22Apr 23, 2025Updated last year
StableFluffy / EasyLLMFeaturePorter
View on GitHub
1-Click is all you need.
☆63Apr 29, 2024Updated 2 years ago
kakao / kanana-2
View on GitHub
☆23Jun 30, 2026Updated 3 weeks ago
Atipico1 / Kor-IR
View on GitHub
Kor-IR: Korean Information Retrieval Benchmark
☆87Jul 3, 2024Updated 2 years ago
overfit-brothers / KRX-2024
View on GitHub
☆12Dec 20, 2024Updated last year
HeegyuKim / open-korean-instructions
View on GitHub
언어모델을 학습하기 위한 공개 한국어 instruction dataset들을 모아두었습니다.
☆469Apr 13, 2025Updated last year
Hugging-Face-KREW / Ko-AgentBench
View on GitHub
☆66Feb 6, 2026Updated 5 months ago
daekeun-ml / evaluate-llm-on-korean-dataset
View on GitHub
Performs benchmarking on two Korean datasets with minimal time and effort.
☆45Jan 22, 2026Updated 5 months ago
human-rights-corpus / HRC
View on GitHub
#인권코퍼스
☆31Oct 6, 2023Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
kakao / FunctionChat-Bench
View on GitHub
☆119Feb 25, 2026Updated 4 months ago
dhk1349 / MERLIN_text_to_video_search
View on GitHub
[EMNLP 2024 Industry track] MERLIN : Multimodal Embedding Refinement via LLM-based Iterative Navigation for Text-Video Retrieval-Rerank P…
☆14Mar 4, 2025Updated last year
EleutherAI / hae-rae
View on GitHub
☆33Aug 30, 2023Updated 2 years ago
hyunwoongko / beyond-lm
View on GitHub
Beyond LM: How can language model go forward in the future?
☆15Apr 30, 2023Updated 3 years ago
Marker-Inc-Korea / COT_steering
View on GitHub
This repository aims to develop CoT Steering based on CoT without Prompting. It focuses on enhancing the model’s latent reasoning capabil…
☆116Jun 25, 2025Updated last year
MLP-Lab / KORMo-tutorial
View on GitHub
☆116Oct 13, 2025Updated 9 months ago
jason9693 / polyglot-finetuning-oslo
View on GitHub
☆19Sep 20, 2022Updated 3 years ago
Ouro-labs / ourocode
View on GitHub
ouroboros native cli with seamless mcp orchestration
☆15Jun 14, 2026Updated last month
LG-AI-EXAONE / KoMT-Bench
View on GitHub
Official repository for KoMT-Bench built by LG AI Research
☆73Aug 8, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hyunwoongko / solar-vs-glm-vs-phi
View on GitHub
Solar vs GLM vs Phi
☆100Jan 2, 2026Updated 6 months ago
MrBananaHuman / PangyoCorpora
View on GitHub
☆36Oct 4, 2023Updated 2 years ago
yjoonjang / rebuttal-skills
View on GitHub
Draft grounded rebuttals to your paper's reviews, with the experiments actually run in your workspace
☆16Updated this week
lassl / lassl
View on GitHub
Easy Language Model Pretraining leveraging Huggingface's Transformers and Datasets
☆130Nov 12, 2022Updated 3 years ago
workdd / LLM_Foreign_Block
View on GitHub
LLM 모델의 외국어 토큰 생성을 막는 코드 구현
☆87Aug 7, 2025Updated 11 months ago
kakao / OrchestrationBench
View on GitHub
☆48Apr 17, 2026Updated 3 months ago
EleutherAI / polyglot-data
View on GitHub
data related codebase for polyglot project
☆19Mar 30, 2023Updated 3 years ago
Marker-Inc-Korea / AutoRAG-example-korean-embedding-benchmark
View on GitHub
AutoRAG example about benchmarking Korean embeddings.
☆45Oct 2, 2024Updated last year
kakao / kanana
View on GitHub
Kanana: Compute-efficient Bilingual Language Models
☆280Jul 23, 2025Updated 11 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
instructkr / reranker-simple-benchmark
View on GitHub
Make running benchmark simple yet maintainable, again. Now only supports Korean-based cross-encoder.
☆35Dec 2, 2025Updated 7 months ago
nlpai-lab / KURE
View on GitHub
KURE: 고려대학교에서 개발한, 한국어 검색에 특화된 임베딩 모델
☆224Apr 14, 2026Updated 3 months ago
sionic-ai / Llama4-Token-Editor
View on GitHub
☆64Jul 21, 2025Updated last year
alohays / openai-tool2mcp
View on GitHub
mcp wrapper for openai built-in tools
☆12Mar 13, 2025Updated last year
hyunwoongko / stop-sequencer
View on GitHub
Implementation of stop sequencer for Huggingface Transformers
☆16Jun 6, 2023Updated 3 years ago
realsigridjin / crisp-py
View on GitHub
The Python Implementation of CRISP: Clustering Multi-Vector Representations for Denoising and Pruning
☆27Jul 27, 2025Updated 11 months ago
kakao / diatool-dpo
View on GitHub
☆15Aug 25, 2025Updated 10 months ago