geronimi73 / phi2-finetune
☆86Updated 9 months ago
Related projects ⓘ
Alternatives and complementary repositories for phi2-finetune
- Just a bunch of benchmark logs for different LLMs☆113Updated 3 months ago
- Small and Efficient Mathematical Reasoning LLMs☆71Updated 9 months ago
- Evaluating LLMs with CommonGen-Lite☆84Updated 7 months ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆118Updated 10 months ago
- ☆38Updated this week
- code for training & evaluating Contextual Document Embedding models☆92Updated this week
- ☆91Updated last month
- Comprehensive analysis of difference in performance of QLora, Lora, and Full Finetunes.☆81Updated last year
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 3 months ago
- Set of scripts to finetune LLMs☆36Updated 7 months ago
- Spherical Merge Pytorch/HF format Language Models with minimal feature loss.☆111Updated last year
- ☆91Updated last year
- A pipeline for LLM knowledge distillation☆77Updated 3 months ago
- Truth Forest: Toward Multi-Scale Truthfulness in Large Language Models through Intervention without Tuning☆41Updated 10 months ago
- ☆48Updated last year
- Doing simple retrieval from LLM models at various context lengths to measure accuracy☆97Updated 7 months ago
- Simple examples using Argilla tools to build AI☆38Updated this week
- Testing speed and accuracy of RAG with, and without Cross Encoder Reranker.☆46Updated 9 months ago
- ☆74Updated last week
- Codebase accompanying the Summary of a Haystack paper.☆71Updated last month
- Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…☆161Updated 9 months ago
- Source code of the paper: RetrievalQA: Assessing Adaptive Retrieval-Augmented Generation for Short-form Open-Domain Question Answering [F…☆57Updated 5 months ago
- This is the official repository for Inheritune.☆105Updated last month
- EvolKit is an innovative framework designed to automatically enhance the complexity of instructions used for fine-tuning Large Language M…☆169Updated last week
- ☆68Updated 2 months ago
- Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models☆69Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆77Updated 6 months ago
- RAFT, or Retrieval-Augmented Fine-Tuning, is a method comprising of a fine-tuning and a RAG-based retrieval phase. It is particularly sui…☆74Updated 2 months ago
- ☆93Updated last year
- ☆49Updated 6 months ago