mzbac/llama2-fine-tune

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mzbac/llama2-fine-tune)

mzbac / llama2-fine-tune

Scripts for fine-tuning Llama2 via SFT and DPO.

☆207

Alternatives and similar repositories for llama2-fine-tune

Users that are interested in llama2-fine-tune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KyujinHan / Sakura-SOLAR-DPO
View on GitHub
Sakura-SOLAR-DPO: Merge, SFT, and DPO
☆116Dec 30, 2023Updated 2 years ago
kh-kim / nlp-express-practice
View on GitHub
☆10Jan 20, 2024Updated 2 years ago
ko-nlp / moducorpus-sanitizer
View on GitHub
모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.
☆11Mar 2, 2022Updated 4 years ago
songys / 2021Langcon
View on GitHub
☆11Oct 3, 2021Updated 4 years ago
isle-dev / MetricEval
View on GitHub
MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…
☆12Nov 6, 2023Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
gauss5930 / iDUS
View on GitHub
An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.
☆14Mar 20, 2024Updated 2 years ago
DAMO-NLP-SG / AdamergeX
View on GitHub
☆11Apr 2, 2024Updated 2 years ago
upskyy / kf-deberta-multitask
View on GitHub
금융 도메인에 특화된 한국어 임베딩 모델
☆23Aug 8, 2024Updated last year
terminal-agent / reptile
View on GitHub
💻 Terminal-Agent with Human-in-the-Loop Learning
☆40Jan 16, 2026Updated 5 months ago
JoJo0217 / rlhf_korean_dataset
View on GitHub
For the rlhf learning environment of Koreans
☆25Sep 25, 2023Updated 2 years ago
metterian / korean_bert_score
View on GitHub
BERT score for text generation
☆12Jan 15, 2025Updated last year
eric-mitchell / direct-preference-optimization
View on GitHub
Reference implementation for DPO (Direct Preference Optimization)
☆2,894Aug 11, 2024Updated last year
dadelani / sib-200
View on GitHub
SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects
☆26May 20, 2026Updated last month
LG-AI-EXAONE / KoMT-Bench
View on GitHub
Official repository for KoMT-Bench built by LG AI Research
☆73Aug 8, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
korean-named-entity / konne
View on GitHub
Korean Nested Named Entity Corpus
☆20May 13, 2023Updated 3 years ago
DSBA-Lab / CodeLab
View on GitHub
DSBA code study
☆30Nov 7, 2023Updated 2 years ago
LeonEricsson / llmjudge
View on GitHub
Exploring limitations of LLM-as-a-judge
☆20Aug 17, 2024Updated last year
microsoft / nlu-incremental-symbol-learning
View on GitHub
incremental symbol learning for natural language understanding
☆10Jun 12, 2023Updated 3 years ago
sionic-ai / webgpu-llm-loader
View on GitHub
A loader that lets you try running LLMs built for WebGPU.
☆29Dec 20, 2023Updated 2 years ago
YongWookHa / kor-text-preprocess
View on GitHub
Korean text data preprocess toolkit for NLP
☆18Jun 11, 2019Updated 7 years ago
JongyoonSong / K-StereoSet
View on GitHub
☆31Oct 15, 2021Updated 4 years ago
TrustAIRLab / VoiceJailbreakAttack
View on GitHub
Code for Voice Jailbreak Attacks Against GPT-4o.
☆38May 31, 2024Updated 2 years ago
SempraETY / Pruning-via-Merging
View on GitHub
☆23Nov 26, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
krafton-ai / MPC
View on GitHub
The git repository of Modular Prompted Chatbot paper
☆35May 24, 2023Updated 3 years ago
lih0905 / WSD_kor
View on GitHub
한국어 어휘 의미 분석 모델
☆25Apr 4, 2022Updated 4 years ago
huggingface / alignment-handbook
View on GitHub
Robust recipes to align language models with human and AI preferences
☆5,623May 26, 2026Updated last month
likenneth / persona_drift
View on GitHub
Measuring and Controlling Persona Drift in Language Model Dialogs
☆25Feb 26, 2024Updated 2 years ago
yihedeng9 / DuoGuard
View on GitHub
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails
☆34Feb 26, 2025Updated last year
jooinjang / Ko-ATOMIC
View on GitHub
Korean Commonsense Knowledge Graph
☆15Dec 23, 2022Updated 3 years ago
mharrend / GPU-Monitoring-Slack-Mattermost
View on GitHub
Monitoring of a GPU system sending either Slack or Mattermost messages via webhooks
☆12Jul 20, 2017Updated 8 years ago
kipi-ai / korpatbert
View on GitHub
특허분야 특화된 한국어 AI언어모델 KorPatBERT
☆70Jan 31, 2024Updated 2 years ago
ZurichNLP / swissbert
View on GitHub
The multilingual language model for Switzerland
☆29Jan 19, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
sail-sg / dice
View on GitHub
Official implementation of Bootstrapping Language Models via DPO Implicit Rewards
☆47Apr 15, 2025Updated last year
joowon-dm-snu / fastcampus-chatgpt-intro-frameworks
View on GitHub
☆19Nov 7, 2023Updated 2 years ago
pacman100 / accelerate-deepspeed-test
View on GitHub
Testing DeepSpeed integration in 🤗 Accelerate
☆11Jun 28, 2022Updated 4 years ago
VITA-Group / Junk_DNA_Hypothesis
View on GitHub
[ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…
☆16Apr 21, 2025Updated last year
Aitchson-Hwang / MNet
View on GitHub
[Neural Networks 2025] The official code for the paper "MNet: A Multi-Scale Network for Visible Watermark Removal."
☆17Jun 16, 2025Updated last year
sooftware / nlp-tasks
View on GitHub
Natural Language Processing Tasks and Examples.
☆62Aug 17, 2022Updated 3 years ago
alexa / places
View on GitHub
This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesis
☆11Feb 17, 2023Updated 3 years ago