Scripts for fine-tuning Llama2 via SFT and DPO.
☆206Aug 14, 2023Updated 2 years ago
Alternatives and similar repositories for llama2-fine-tune
Users that are interested in llama2-fine-tune are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Jan 20, 2024Updated 2 years ago
- 모두의 말뭉치 데이터를 분석에 편리한 형태로 변환하는 기능을 제공합니다.☆11Mar 2, 2022Updated 4 years ago
- ☆11Oct 3, 2021Updated 4 years ago
- MetricEval: A framework that conceptualizes and operationalizes four main components of metric evaluation, in terms of reliability and va…☆12Nov 6, 2023Updated 2 years ago
- An unofficial implementation of SOLAR-10.7B model and the newly proposed interlocked-DUS(iDUS) implementation and experiment details.☆14Mar 20, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆11Apr 2, 2024Updated 2 years ago
- 금융 도메인에 특화된 한국어 임베딩 모델☆22Aug 8, 2024Updated last year
- 💻 Terminal-Agent with Human-in-the-Loop Learning☆39Jan 16, 2026Updated 3 months ago
- For the rlhf learning environment of Koreans☆25Sep 25, 2023Updated 2 years ago
- BERT score for text generation☆12Jan 15, 2025Updated last year
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects☆23Jan 26, 2025Updated last year
- Official repository for KoMT-Bench built by LG AI Research☆71Aug 8, 2024Updated last year
- Korean Nested Named Entity Corpus☆20May 13, 2023Updated 2 years ago
- Official implementation of SIGIR 2022 Paper "Task-Oriented Dialogue System as Natural Language Generation".☆14Apr 6, 2022Updated 4 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- DSBA code study☆30Nov 7, 2023Updated 2 years ago
- Exploring limitations of LLM-as-a-judge☆20Aug 17, 2024Updated last year
- incremental symbol learning for natural language understanding☆10Jun 12, 2023Updated 2 years ago
- Korean text data preprocess toolkit for NLP☆18Jun 11, 2019Updated 6 years ago
- ☆31Oct 15, 2021Updated 4 years ago
- Fine-tuning, DPO, RLHF, RLAIF on LLMs - Qwen3, Zephyr 7B GPTQ with 4-Bit Quantization, Mistral-7B-GPTQ☆15Jul 5, 2025Updated 9 months ago
- ☆23Nov 26, 2024Updated last year
- The git repository of Modular Prompted Chatbot paper☆35May 24, 2023Updated 2 years ago
- Code for the experiments in the ACL 2020 paper "Estimating predictive uncertainty for rumour verification models"☆11May 15, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 한국어 어휘 의미 분석 모델☆23Apr 4, 2022Updated 4 years ago
- Robust recipes to align language models with human and AI preferences☆5,587Apr 8, 2026Updated 3 weeks ago
- Korean Named Entity Corpus☆25May 12, 2023Updated 2 years ago
- DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails☆32Feb 26, 2025Updated last year
- Monitoring of a GPU system sending either Slack or Mattermost messages via webhooks☆12Jul 20, 2017Updated 8 years ago
- 특허분야 특화된 한국어 AI언어모델 KorPatBERT☆67Jan 31, 2024Updated 2 years ago
- The multilingual language model for Switzerland☆28Jan 19, 2024Updated 2 years ago
- Simple extension for text-generation-webui that injects recent conversation history into the negative prompt with the goal of minimizing …☆32Nov 20, 2023Updated 2 years ago
- ☆19Nov 7, 2023Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆47Apr 15, 2025Updated last year
- Testing DeepSpeed integration in 🤗 Accelerate☆11Jun 28, 2022Updated 3 years ago
- [Neural Networks 2025] The official code for the paper "MNet: A Multi-Scale Network for Visible Watermark Removal."☆17Jun 16, 2025Updated 10 months ago
- This repository contains papers for a comprehensive survey on accelerated generation techniques in Large Language Models (LLMs).☆11May 24, 2024Updated last year
- [ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…☆16Apr 21, 2025Updated last year
- Natural Language Processing Tasks and Examples.☆62Aug 17, 2022Updated 3 years ago
- This is the code for our paper: PLACES: Prompting Language Models for Social Conversation Synthesis☆11Feb 17, 2023Updated 3 years ago