Mr-Tieguigui/LLM-Post-Training

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Mr-Tieguigui/LLM-Post-Training)

Mr-Tieguigui / LLM-Post-Training

☆96

Alternatives and similar repositories for LLM-Post-Training

Users that are interested in LLM-Post-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

fplaunchpad / cs3100_m25
View on GitHub
IITM Paradigms of Programming -- Monsoon 2025
☆18Nov 17, 2025Updated 7 months ago
inclusionAI / MoBE
View on GitHub
Mixture-of-Basis-Experts for Compressing MoE-based LLMs
☆35Dec 24, 2025Updated 6 months ago
FlyTune / MASPO-RL
View on GitHub
Mass-Adaptive Soft Policy Optimization (MASPO) - Official Implementation
☆58Apr 27, 2026Updated 2 months ago
FlyTune / DGPO-RL
View on GitHub
Decoupled Gradient Policy Optimization (DGPO) - Official Implementation
☆48Apr 22, 2026Updated 2 months ago
adrialopezescoriza / demo3
View on GitHub
Official implementation of DEMO3
☆68Jul 29, 2025Updated 10 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
haowei-freesky / HERMES
View on GitHub
Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding" [ACL 2026]
☆90May 8, 2026Updated last month
kxfan2002 / Reagent
View on GitHub
Agent-RRM: Exploring Reasoning Reward Model for Agents
☆70Mar 17, 2026Updated 3 months ago
zhouhuan-hust / DiCNet
View on GitHub
This repository provides the PyTorch implementation of the paper: Anomaly Discovery in Semantic Segmentation via Distillation Comparison …
☆15Apr 18, 2023Updated 3 years ago
ChanLiang / ORIG
View on GitHub
[ACL 2023 findings] Towards Robust Personalized Dialogue Generation via Order-Insensitive Representation Regularization
☆17Aug 26, 2023Updated 2 years ago
bigai-nlco / RuleReasoner
View on GitHub
[ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling
☆38Feb 25, 2026Updated 4 months ago
nauyan / PyTorch-Distributed-Tutorials
View on GitHub
Concise tutorials for distributed training using PyTorch
☆10Apr 18, 2023Updated 3 years ago
Aitchson-Hwang / adversarial_visible_watermarking
View on GitHub
[TIFS 2025] The official code for TIFS paper "New Visible Watermark Protection Mechanism Based on Information Hiding"
☆13Oct 27, 2025Updated 8 months ago
sygi / vic-tensorflow
View on GitHub
Implementation of Variational Intrinsic Control in tensorflow
☆11Apr 5, 2017Updated 9 years ago
snap-stanford / optimas
View on GitHub
(ICLR 2026) Optimas: Optimizing Compound AI Systems
☆80Feb 6, 2026Updated 4 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
iwangjian / TRIP
View on GitHub
[TOIS 2024] Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue
☆13Oct 18, 2025Updated 8 months ago
Steven-Ho / VALOR
View on GitHub
Implementation of VALOR (Variational Option Discovery Algorithms)
☆10Jun 28, 2019Updated 7 years ago
olehb / pytorch_ddp_tutorial
View on GitHub
A step-by-step tutorial about how to use Distributed Data Parallel feature of PyTorch
☆16Nov 20, 2020Updated 5 years ago
iLearn-Lab / ACL25-PTQ1.61
View on GitHub
☆15Apr 6, 2026Updated 2 months ago
liyongqi2002 / Email_Client
View on GitHub
从socket开始实现pop3和smtp客户端，实现邮件编写、发送、接收、阅读、删除等基本功能。并实现简单界面（PyQt5）Start from socket to implement pop3 and smtp clients, to realize the basic …
☆12Dec 24, 2023Updated 2 years ago
shoaibahmed / llm_depth_pruning
View on GitHub
Official implementation of the paper: "A deeper look at depth pruning of LLMs"
☆15Jul 24, 2024Updated last year
mzmmoazam / kashmiri_dataset
View on GitHub
Data and tool to fetch kashmiri text
☆16Aug 2, 2020Updated 5 years ago
LanceZPF / OpenING
View on GitHub
Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation
☆64Jul 5, 2025Updated 11 months ago
WangWenhao0716 / PDF-Embedding
View on GitHub
[NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"
☆18Oct 1, 2024Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
JiaChangGit / Embedded-operating-systems-NCKU
View on GitHub
嵌入式作業系統分析與實作 ANALYSIS AND IMPLEMENTATION OF EMBEDDED OPERATING SYSTEMS, 張大緯
☆15Updated this week
AbhilashaRavichander / HALoGEN
View on GitHub
Code for the paper "HALoGEN: Fantastic LLM Hallucinations and Where To Find Them"
☆25May 18, 2025Updated last year
zzasdf / VietASR
View on GitHub
☆51Sep 3, 2025Updated 9 months ago
linjh1118 / WisdoMentor
View on GitHub
WisdoMentor - Series: A LLM for undergraduates | 博导智言(辅助大学生学习)
☆13May 9, 2024Updated 2 years ago
linjh1118 / Llama3-Chinese-ORPO
View on GitHub
基于Llama3，通过进一步CPT，SFT，ORPO得到的中文版Llama3
☆16Apr 24, 2024Updated 2 years ago
naivoder / MCTSr
View on GitHub
Monte Carlo Tree Search Self-Refine (MCTSr)
☆22Jul 6, 2024Updated last year
oezyurty / REPLM
View on GitHub
Document-Level In-Context Few-Shot Relation Extraction via Pre-Trained Language Models
☆12May 15, 2024Updated 2 years ago
git-disl / Lisa
View on GitHub
This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning" (NeurIPS2024)
☆28Sep 10, 2024Updated last year
ocademy-ai / open-learning-resources
View on GitHub
A curated list of free/open source resources for you to learn Computer Science.
☆21Jul 4, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
linkedin / ControlLLM
View on GitHub
Control LLM
☆23Apr 6, 2025Updated last year
sylviayuan-sy / LARM
View on GitHub
☆35Jan 27, 2026Updated 5 months ago
GuanghaoYe / Emergence-of-Thinking
View on GitHub
☆55Feb 11, 2025Updated last year
nicolaus625 / CMI-bench
View on GitHub
☆18Jun 24, 2025Updated last year
JingMog / THOR
View on GitHub
[ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".
☆32Feb 26, 2026Updated 4 months ago
Columbia-NLP-Lab / LionAlignment
View on GitHub
☆12Aug 6, 2024Updated last year
YJiangcm / BMC
View on GitHub
[ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization
☆12Jan 26, 2025Updated last year