menhguin / minp_paper
Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper
☆24Updated this week
Alternatives and similar repositories for minp_paper:
Users that are interested in minp_paper are comparing it to the libraries listed below
- Codebase for Instruction Following without Instruction Tuning☆33Updated 5 months ago
- ☆59Updated last month
- A repository for research on medium sized language models.☆77Updated 9 months ago
- The official code repo and data hub of top_nsigma sampling strategy for LLMs.☆22Updated last month
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆32Updated last year
- ☆31Updated 8 months ago
- ☆23Updated 5 months ago
- ☆59Updated 10 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- ☆20Updated 4 months ago
- ☆42Updated 2 months ago
- Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"☆42Updated 4 months ago
- Minimal implementation of the Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models paper (ArXiv 20232401.01335)☆29Updated last year
- Official repository of "LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging"☆24Updated 4 months ago
- Mixture of Expert (MoE) techniques for enhancing LLM performance through expert-driven prompt mapping and adapter combinations.☆12Updated last year
- This is the oficial repository for "Safer-Instruct: Aligning Language Models with Automated Preference Data"☆17Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆46Updated last year
- SELF-GUIDE: Better Task-Specific Instruction Following via Self-Synthetic Finetuning. COLM 2024 Accepted Paper☆29Updated 9 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 6 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆84Updated last year
- Repository for the paper: 500xCompressor: Generalized Prompt Compression for Large Language Models☆30Updated 7 months ago
- ☆16Updated 2 months ago
- Data preparation code for CrystalCoder 7B LLM☆44Updated 10 months ago
- Flow of Reasoning: Training LLMs for Divergent Problem Solving with Minimal Examples☆76Updated last week