☆92May 23, 2025Updated 11 months ago
Alternatives and similar repositories for LLM-Post-Training
Users that are interested in LLM-Post-Training are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- AutoHub: A Personal Browser Automation Assistant☆25Jul 30, 2025Updated 9 months ago
- Ensemble Neural Representation Networks☆12Jan 5, 2022Updated 4 years ago
- Official implementation of DEMO3☆67Jul 29, 2025Updated 9 months ago
- Agent-RRM: Exploring Reasoning Reward Model for Agents☆64Mar 17, 2026Updated 2 months ago
- [ACL 2023 findings] Towards Robust Personalized Dialogue Generation via Order-Insensitive Representation Regularization☆17Aug 26, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2026] RuleReasoner: Reinforced Rule-based Reasoning via Domain-aware Dynamic Sampling☆38Feb 25, 2026Updated 2 months ago
- Concise tutorials for distributed training using PyTorch☆10Apr 18, 2023Updated 3 years ago
- [TOIS 2024] Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue☆13Oct 18, 2025Updated 7 months ago
- pydantic-ai 介紹教學☆16Aug 17, 2025Updated 9 months ago
- Official Implementation of OpenING: A Comprehensive Benchmark for Judging Open-ended Interleaved Image-Text Generation☆41Jul 5, 2025Updated 10 months ago
- Repository having the code and models from the paper: data2vec-aqc: Search for the right Teaching Assistant in the Teacher-Student traini…☆13Mar 18, 2024Updated 2 years ago
- ☆15Apr 6, 2026Updated last month
- 从socket开始实现pop3和smtp客户端,实现邮件编写、发送、接收、阅读、删除等基本功能。并实现简单界面(PyQt5)Start from socket to implement pop3 and smtp clients, to realize the basic …☆12Dec 24, 2023Updated 2 years ago
- Official implementation of the paper: "A deeper look at depth pruning of LLMs"☆15Jul 24, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [NeurIPS 2024] The official implementation of "Image Copy Detection for Diffusion Models"☆18Oct 1, 2024Updated last year
- 嵌入式作業系統分析與實作 ANALYSIS AND IMPLEMENTATION OF EMBEDDED OPERATING SYSTEMS, 張大緯☆15Jun 22, 2024Updated last year
- Minimal codes for "Task-Oriented Dexterous Hand Pose Synthesis Using Differentiable Grasp Wrench Boundary Estimator [IROS 2024]"☆15Feb 12, 2025Updated last year
- ☆51Sep 3, 2025Updated 8 months ago
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆14Sep 1, 2025Updated 8 months ago
- Cue-CoT: Chain-of-thought Prompting for Responding to In-depth Dialogue Questions with LLMs [EMNLP 2023 Findings]☆24Nov 18, 2023Updated 2 years ago
- ☆19Jan 8, 2026Updated 4 months ago
- KnowLA: Enhancing Parameter-efficient Finetuning with Knowledgeable Adaptation, NAACL 2024☆16Jul 29, 2024Updated last year
- 基于Llama3,通过进一步CPT,SFT,ORPO得到的中文版Llama3☆16Apr 24, 2024Updated 2 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- ☆42May 9, 2024Updated 2 years ago
- This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning" (NeurIPS2024)☆27Sep 10, 2024Updated last year
- Document-Level In-Context Few-Shot Relation Extraction via Pre-Trained Language Models☆12May 15, 2024Updated 2 years ago
- Control LLM☆23Apr 6, 2025Updated last year
- ☆35Jan 27, 2026Updated 3 months ago
- ☆54Feb 11, 2025Updated last year
- ☆18Jun 24, 2025Updated 10 months ago
- An implementation of model parallel autoregressive transformers on GPUs, based on the DeepSpeed library.☆13Jun 7, 2023Updated 2 years ago
- ☆12Aug 6, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICLR 2025] Bridging and Modeling Correlations in Pairwise Data for Direct Preference Optimization☆12Jan 26, 2025Updated last year
- Official implementation of ECCV24 paper: POA☆24Aug 8, 2024Updated last year
- OneEdit: A Neural-Symbolic Collaboratively Knowledge Editing System.☆20Oct 14, 2024Updated last year
- This is the official code for the paper "Safety Tax: Safety Alignment Makes Your Large Reasoning Models Less Reasonable".☆33Mar 11, 2025Updated last year
- Plancraft is a minecraft environment and agent suite to test planning capabilities in LLMs☆27Nov 7, 2025Updated 6 months ago
- Codes for the paper "Multi-task Hierarchical Adversarial Inverse Reinforcement Learning"☆19May 20, 2023Updated 2 years ago
- [COLM 2025] "C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing"☆20Apr 9, 2025Updated last year