alinourian / Fine-tuning-Mistral-7b-QA
Fine tuning Mistral-7b with PEFT(Parameter Efficient Fine-Tuning) and LoRA(Low-Rank Adaptation) on Puffin Dataset(multi-turn conversations between GPT-4 and real humans)
☆13Updated last year
Alternatives and similar repositories for Fine-tuning-Mistral-7b-QA:
Users that are interested in Fine-tuning-Mistral-7b-QA are comparing it to the libraries listed below
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆33Updated last year
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆55Updated 6 months ago
- distill chatGPT coding ability into small model (1b)☆28Updated last year
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- Aioli: A unified optimization framework for language model data mixing☆22Updated 2 months ago
- ReBase: Training Task Experts through Retrieval Based Distillation☆28Updated last month
- ☆20Updated 3 weeks ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆34Updated last year
- ☆48Updated 4 months ago
- ☆24Updated 6 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆35Updated 10 months ago
- ☆60Updated last month
- ☆16Updated 3 weeks ago
- Official implementation for "Law of the Weakest Link: Cross capabilities of Large Language Models"☆42Updated 5 months ago
- A repository for research on medium sized language models.☆76Updated 10 months ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 3 months ago
- A simple GPT-based evaluation tool for multi-aspect, interpretable assessment of LLMs.☆84Updated last year
- "Syntriever: How to Train Your Retriever with Synthetic Data from LLMs" the Nations of the Americas Chapter of the Association for Comput…☆24Updated 3 weeks ago
- ☆42Updated last month
- Codebase accompanying the Summary of a Haystack paper.☆75Updated 6 months ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆82Updated last year
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆38Updated this week
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- A testbed for agents and environments that can automatically improve models through data generation.☆21Updated 3 weeks ago
- Understanding the correlation between different LLM benchmarks☆29Updated last year
- ☆15Updated last year
- Lottery Ticket Adaptation☆38Updated 4 months ago
- ☆57Updated 8 months ago
- Code Implementation, Evaluations, Documentation, Links and Resources for Min P paper☆28Updated last week