alinourian / Fine-tuning-Mistral-7b-QA
Fine tuning Mistral-7b with PEFT(Parameter Efficient Fine-Tuning) and LoRA(Low-Rank Adaptation) on Puffin Dataset(multi-turn conversations between GPT-4 and real humans)
☆12Updated last year
Alternatives and similar repositories for Fine-tuning-Mistral-7b-QA:
Users that are interested in Fine-tuning-Mistral-7b-QA are comparing it to the libraries listed below
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆76Updated 4 months ago
- In-Context Alignment: Chat with Vanilla Language Models Before Fine-Tuning☆33Updated last year
- Aioli: A unified optimization framework for language model data mixing☆20Updated last month
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated 11 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆54Updated 5 months ago
- ☆20Updated last week
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆63Updated last month
- Small and Efficient Mathematical Reasoning LLMs☆71Updated last year
- This is a new metric that can be used to evaluate faithfulness of text generated by LLMs. The work behind this repository can be found he…☆31Updated last year
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated last year
- LLM reads a paper and produce a working prototype☆48Updated 2 weeks ago
- Nexusflow function call, tool use, and agent benchmarks.☆19Updated 2 months ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆46Updated last year
- DSPy program/pipeline inspector widget for Jupyter/VSCode Notebooks.☆31Updated last year
- ☆57Updated 7 months ago
- Improving Text Embedding of Language Models Using Contrastive Fine-tuning☆59Updated 6 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆82Updated last month
- ☆60Updated last week
- implementation of https://arxiv.org/pdf/2312.09299☆20Updated 7 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆38Updated 11 months ago
- The Next Generation Multi-Modality Superintelligence☆71Updated 5 months ago
- An open source replication of the stawberry method that leverages Monte Carlo Search with PPO and or DPO☆28Updated last week
- ☆23Updated 5 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated 11 months ago
- Using open source LLMs to build synthetic datasets for direct preference optimization☆57Updated 11 months ago
- Repo hosting codes and materials related to speeding LLMs' inference using token merging.☆35Updated 9 months ago
- minimal LLM scripts for 24GB VRAM GPUs. training, inference, whatever☆37Updated 3 weeks ago
- High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡☆64Updated 3 months ago