☆94May 23, 2025Updated last year
Alternatives and similar repositories for LLM-Post-Training
Users that are interested in LLM-Post-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Mixture-of-Basis-Experts for Compressing MoE-based LLMs☆34Dec 24, 2025Updated 5 months ago
- Mass-Adaptive Soft Policy Optimization (MASPO) - Official Implementation☆56Apr 27, 2026Updated last month
- Official implementation of DEMO3☆67Jul 29, 2025Updated 10 months ago
- Official Repository for paper "HERMES: KV Cache as Hierarchical Memory for Efficient Streaming Video Understanding" [ACL 2026]☆86May 8, 2026Updated last month
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆67Mar 17, 2026Updated 2 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- This repository provides the PyTorch implementation of the paper: Anomaly Discovery in Semantic Segmentation via Distillation Comparison …☆15Apr 18, 2023Updated 3 years ago
- [CVPR 2024] Adapting Short-Term Transformers for Action Detection in Untrimmed Videos☆11Jun 11, 2024Updated last year
- [ACL 2023 findings] Towards Robust Personalized Dialogue Generation via Order-Insensitive Representation Regularization☆17Aug 26, 2023Updated 2 years ago
- Concise tutorials for distributed training using PyTorch☆10Apr 18, 2023Updated 3 years ago
- [TOIS 2024] Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue☆13Oct 18, 2025Updated 7 months ago
- Implementation of VALOR (Variational Option Discovery Algorithms)☆10Jun 28, 2019Updated 6 years ago
- Alignment-Free methods Adjusted by Neural Network☆13Feb 20, 2020Updated 6 years ago
- A step-by-step tutorial about how to use Distributed Data Parallel feature of PyTorch☆16Nov 20, 2020Updated 5 years ago
- Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation☆41Jul 5, 2025Updated 11 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated 2 years ago
- ☆15Apr 6, 2026Updated 2 months ago
- 从socket开始实现pop3和smtp客户端,实现邮件编写、发送、接收、阅读、删除等基本功能。并实现简单界面(PyQt5)Start from socket to implement pop3 and smtp clients, to realize the basic …☆12Dec 24, 2023Updated 2 years ago
- Code of LVAgent: Long Video Understanding by Multi-Round Dynamical Collaboration of MLLM Agents☆34Nov 24, 2025Updated 6 months ago
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- Unofficial implementation of the Ask-LLM paper 'How to Train Data-Efficient LLMs', arXiv:2402.09668.☆12Jun 19, 2024Updated last year
- ☆33Aug 21, 2025Updated 9 months ago
- EVOLVE-VLA: Test-Time Training from Environment Feedback for Vision-Language-Action Models☆86Dec 17, 2025Updated 5 months ago
- Seeing from Another Perspective: Evaluating Multi-View Understanding in MLLMs☆68Mar 22, 2026Updated 2 months ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- 嵌入式作業系統分析與實作 ANALYSIS AND IMPLEMENTATION OF EMBEDDED OPERATING SYSTEMS, 張大緯☆15Jun 22, 2024Updated last year
- 华中科技大学课程作业:华中科技大学电信系微机原理实验代码☆23May 16, 2021Updated 5 years ago
- ☆51Sep 3, 2025Updated 9 months ago
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆14Sep 1, 2025Updated 9 months ago
- Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs [EMNLP 2023 Findings]☆24Nov 18, 2023Updated 2 years ago
- WisdoMentor - Series: A LLM for undergraduates | 博导智言(辅助大学生 学习)☆13May 9, 2024Updated 2 years ago
- ☆19Jan 8, 2026Updated 4 months ago
- KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation, NAACL 2024☆16Jul 29, 2024Updated last year
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆16Apr 24, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Monte Carlo Tree Search Self-Refine (MCTSr)☆22Jul 6, 2024Updated last year
- This repo is reproduction resources for linear alignment paper, still working☆18May 19, 2024Updated 2 years ago
- Contrastive Learning for Improving ASR Robustness in Spoken Language Understanding☆11May 19, 2023Updated 3 years ago
- Control LLM☆23Apr 6, 2025Updated last year
- ☆35Jan 27, 2026Updated 4 months ago
- ☆55Feb 11, 2025Updated last year
- [ICLR-2026] Official Implementation of our paper "THOR: Tool-Integrated Hierarchical Optimization via RL for Mathematical Reasoning".☆32Feb 26, 2026Updated 3 months ago