bdytx5 / mistral7B_finetune
fine tuning mistral 7B using Huggingface, Weights and Biases, Choline, and Vast AI
☆37Updated 11 months ago
Related projects: ⓘ
- ☆83Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 2 months ago
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆161Updated 8 months ago
- RAFT, or Retrieval-Augmented Fine-Tuning, is a method comprising of a fine-tuning and a RAG-based retrieval phase. It is particularly sui…☆60Updated 3 weeks ago
- ASR + diarization model server with speculative decoding☆46Updated 3 months ago
- ☆37Updated 9 months ago
- This repo is for handling Question Answering, especially for Multi-hop Question Answering☆59Updated 9 months ago
- StructuredRAG Benchmarker☆85Updated last week
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆64Updated 2 months ago
- GPT-4 Level Conversational QA Trained In a Few Hours☆53Updated 3 weeks ago
- Finetune Falcon, LLaMA, MPT, and RedPajama on consumer hardware using PEFT LoRA☆99Updated last month
- Small and Efficient Mathematical Reasoning LLMs☆69Updated 7 months ago
- Code repo for "Agent Instructs Large Language Models to be General Zero-Shot Reasoners"☆68Updated last week
- Some simple scripts that I use day-to-day when working with LLMs and Huggingface Hub☆154Updated 11 months ago
- ☆40Updated 6 months ago
- ARAGOG- Advanced RAG Output Grading. Exploring and comparing various Retrieval-Augmented Generation (RAG) techniques on AI research paper…☆91Updated 5 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆117Updated 8 months ago
- Here is a Google Colab Notebook for fine-tuning Alpaca Lora (within 3 hours with a 40GB A100 GPU)☆39Updated last year
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆57Updated 3 months ago
- ☆85Updated 7 months ago
- ☆51Updated last month
- Sakura-SOLAR-DPO: Merge, SFT, and DPO☆114Updated 8 months ago
- This repository implements the chain of verification paper by Meta AI☆151Updated 11 months ago
- Beating the GAIA benchmark with Transformers Agents. 🚀☆56Updated 2 weeks ago
- Track the progress of LLM context utilisation☆53Updated 2 months ago
- ☆64Updated last year
- Evaluating LLMs with CommonGen-Lite☆83Updated 6 months ago
- Learning to Program with Natural Language☆5Updated 9 months ago
- Code for NeurIPS LLM Efficiency Challenge☆52Updated 5 months ago
- Convenient wrapper for fine-tuning and inference of Large Language Models (LLMs) with several quantization techniques (GTPQ, bitsandbytes…☆139Updated 11 months ago