davanstrien / haiku-dpo
Using open source LLMs to build synthetic datasets for direct preference optimization
☆40Updated 8 months ago
Related projects ⓘ
Alternatives and complementary repositories for haiku-dpo
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆22Updated 7 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆48Updated 3 months ago
- ☆24Updated last year
- ☆38Updated this week
- QLoRA with Enhanced Multi GPU Support☆36Updated last year
- ☆48Updated last year
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- Simple replication of [ColBERT-v1](https://arxiv.org/abs/2004.12832).☆76Updated 7 months ago
- Writing Blog Posts with Generative Feedback Loops!☆42Updated 7 months ago
- Lightweight demos for finetuning LLMs. Powered by 🤗 transformers and open-source datasets.☆64Updated 2 weeks ago
- Tools to make language models a bit easier to use☆30Updated 2 weeks ago
- ☆91Updated last month
- Code and data for "StructLM: Towards Building Generalist Models for Structured Knowledge Grounding" (COLM 2024)☆68Updated 3 weeks ago
- Set of scripts to finetune LLMs☆36Updated 7 months ago
- Official code for ACL 2023 (short, findings) paper "Recursion of Thought: A Divide and Conquer Approach to Multi-Context Reasoning with L…☆42Updated last year
- QLoRA for Masked Language Modeling☆20Updated last year
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆48Updated this week
- A library for squeakily cleaning and filtering language datasets.☆45Updated last year
- Simple examples using Argilla tools to build AI☆38Updated this week
- ☆39Updated 2 weeks ago
- A framework for evaluating function calls made by LLMs☆34Updated 3 months ago
- minimal pytorch implementation of bm25 (with sparse tensors)☆88Updated 8 months ago
- Efficient few-shot learning with cross-encoders.☆40Updated 8 months ago
- The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models☆19Updated 9 months ago
- Explore the use of DSPy for extracting features from PDFs 🔎☆32Updated 8 months ago
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- Just a bunch of benchmark logs for different LLMs☆113Updated 3 months ago
- ☆46Updated 9 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆46Updated 2 months ago