☆96May 23, 2025Updated last year
Alternatives and similar repositories for LLM-Post-Training
Users that are interested in LLM-Post-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- IITM Paradigms of Programming -- Monsoon 2025☆18Nov 17, 2025Updated 7 months ago
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆35Dec 24, 2025Updated 6 months ago
- Mass-Adaptive Soft Policy Optimization (MASPO) - Official Implementation☆58Apr 27, 2026Updated 2 months ago
- Decoupled Gradient Policy Optimization (DGPO) - Official Implementation☆48Apr 22, 2026Updated 2 months ago
- Official implementation of DEMO3☆68Jul 29, 2025Updated 10 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding" [ACL 2026]☆90May 8, 2026Updated last month
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆70Mar 17, 2026Updated 3 months ago
- This repository provides the PyTorch implementation of the paper: Anomaly Discovery in Semantic Segmentation via Distillation Comparison …☆15Apr 18, 2023Updated 3 years ago
- [ACL 2023 findings] Towards Robust Personalized Dialogue Generation via Order-Insensitive Representation Regularization☆17Aug 26, 2023Updated 2 years ago
- [ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling☆38Feb 25, 2026Updated 4 months ago
- Concise tutorials for distributed training using PyTorch☆10Apr 18, 2023Updated 3 years ago
- [TIFS 2025] The official code for TIFS paper "New Visible Watermark Protection Mechanism Based on Information Hiding"☆13Oct 27, 2025Updated 8 months ago
- Implementation of Variational Intrinsic Control in tensorflow☆11Apr 5, 2017Updated 9 years ago
- (ICLR 2026) Optimas: Optimizing Compound AI Systems☆80Feb 6, 2026Updated 4 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [TOIS 2024] Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue☆13Oct 18, 2025Updated 8 months ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Jun 28, 2019Updated 7 years ago
- A step-by-step tutorial about how to use Distributed Data Parallel feature of PyTorch☆16Nov 20, 2020Updated 5 years ago
- ☆15Apr 6, 2026Updated 2 months ago
- 从socket开始实现pop3和smtp客户端,实现邮件编写、发送、接收、阅读、删除等基本功能。并实现简单界面(PyQt5)Start from socket to implement pop3 and smtp clients, to realize the basic …☆12Dec 24, 2023Updated 2 years ago
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- Data and tool to fetch kashmiri text☆16Aug 2, 2020Updated 5 years ago
- Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation☆64Jul 5, 2025Updated 11 months ago
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- 嵌入式作業系統分析與實作 ANALYSIS AND IMPLEMENTATION OF EMBEDDED OPERATING SYSTEMS, 張大緯☆15Updated this week
- Code for the paper "HALoGEN: Fantastic LLM Hallucinations and Where To Find Them"☆25May 18, 2025Updated last year
- ☆51Sep 3, 2025Updated 9 months ago
- WisdoMentor - Series: A LLM for undergraduates | 博导智言(辅助大学生 学习)☆13May 9, 2024Updated 2 years ago
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆16Apr 24, 2024Updated 2 years ago
- Monte Carlo Tree Search Self-Refine (MCTSr)☆22Jul 6, 2024Updated last year
- Document-Level In-Context Few-Shot Relation Extraction via Pre-Trained Language Models☆12May 15, 2024Updated 2 years ago
- This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning" (NeurIPS2024)☆28Sep 10, 2024Updated last year
- A curated list of free/open source resources for you to learn Computer Science.☆21Jul 4, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Control LLM☆23Apr 6, 2025Updated last year
- ☆35Jan 27, 2026Updated 5 months ago
- ☆55Feb 11, 2025Updated last year
- ☆18Jun 24, 2025Updated last year
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated 4 months ago
- ☆12Aug 6, 2024Updated last year
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Jan 26, 2025Updated last year