davanstrien / haiku-dpoLinks

Using open source LLMs to build synthetic datasets for direct preference optimization

☆68

Alternatives and similar repositories for haiku-dpo

Users that are interested in haiku-dpo are comparing it to the libraries listed below

Sorting:

AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆49Updated 8 months ago
daniel-furman / sft-demos
Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.
☆78Updated last year
argilla-io / distilabel-spin-dibt
Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆24Updated last year
Knowledgator / FlashDeBERTa
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆66Updated 3 weeks ago
louisbrulenaudet / ragoon
High level library for batched embeddings generation, blazingly-fast web-based RAG and quantized indexes processing ⚡
☆67Updated 11 months ago
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆50Updated last year
QuixiAI / spectrum
☆136Updated 2 months ago
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated 2 years ago
arcee-ai / DAM
☆55Updated 11 months ago
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆33Updated last month
PrithivirajDamodaran / blitz-embed
C++ inference wrappers for running blazing fast embedding services on your favourite serverless like AWS Lambda. By Prithivi Da, PRs welc…
☆23Updated last year
salesforce / summary-of-a-haystack
Codebase accompanying the Summary of a Haystack paper.
☆79Updated last year
Pleias / Pleias-RAG-Library
Python library to use Pleias-RAG models
☆63Updated 5 months ago
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 10 months ago
argilla-io / notus
Notus is a collection of fine-tuned LLMs using SFT, DPO, SFT+DPO, and/or any other RLHF techniques, while always keeping a data-first app…
☆169Updated last year
Knowledgator / unlimited_classifier
Universal text classifier for generative models
☆25Updated last year
huggingface / data-is-better-together
Let's build better datasets, together!
☆262Updated 10 months ago
geronimi73 / phi2-finetune
☆88Updated last year
pacman100 / peft-codegen-25
☆23Updated 2 years ago
TIGER-AI-Lab / StructLM
Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)
☆75Updated last year
minosvasilias / simple_grpo
Simple GRPO scripts and configurations.
☆59Updated 8 months ago
Hannibal046 / nanoColBERT
Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).
☆79Updated last year
akjindal53244 / Arithmo
Small and Efficient Mathematical Reasoning LLMs
☆72Updated last year
automix-llm / automix
Mixing Language Models with Self-Verification and Meta-Verification
☆109Updated 10 months ago
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆145Updated 8 months ago
weaviate-tutorials / Hurricane
Writing Blog Posts with Generative Feedback Loops!
☆50Updated last year
s-smits / modernbert-finetune
Fine-tune ModernBERT on a large Dataset with Custom Tokenizer Training
☆68Updated last week
mungg / FABLES
☆57Updated last year
sileod / tasksource
Datasets collection and preprocessings framework for NLP extreme multitask learning
☆188Updated 3 months ago
Pleias / Various-Finetuning
Set of scripts to finetune LLMs
☆38Updated last year