Repository containing the SPIN experiments on the DIBT 10k ranked prompts
☆23Mar 12, 2024Updated last year
Alternatives and similar repositories for distilabel-spin-dibt
Users that are interested in distilabel-spin-dibt are comparing it to the libraries listed below
Sorting:
- Open-source repository for the OOPSLA'24 paper "CYCLE: Learning to Self-Refine Code Generation"☆10Mar 8, 2024Updated 2 years ago
- [KDD24-ADS] R-Eval: A Unified Toolkit for Evaluating Domain Knowledge of Retrieval Augmented Large Language Models☆11Apr 9, 2024Updated last year
- Submodule of evalverse forked from [google-research/instruction_following_eval](https://github.com/google-research/google-research/tree/m…☆14May 4, 2024Updated last year
- [ICML25] CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale☆25Jul 31, 2025Updated 7 months ago
- A bagel, with everything.☆326Apr 11, 2024Updated last year
- ☆15Dec 22, 2023Updated 2 years ago
- Measuring and Controlling Persona Drift in Language Model Dialogs☆22Feb 26, 2024Updated 2 years ago
- a version of baby agi using dspy and typed predictors☆16Mar 9, 2024Updated last year
- This repository contains the ToolSelect dataset which was used to fine-tune Llama-2 70B for tool selection.☆22Mar 11, 2024Updated last year
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our foc…☆32Jun 13, 2024Updated last year
- Using open source LLMs to build synthetic datasets for direct preference optimization☆72Feb 29, 2024Updated 2 years ago
- Code repo for "Model-Generated Pretraining Signals Improves Zero-Shot Generalization of Text-to-Text Transformers" (ACL 2023)☆22Nov 1, 2023Updated 2 years ago
- This repo is reproduction resources for linear alignment paper, still working☆18May 19, 2024Updated last year
- Just a bunch of benchmark logs for different LLMs☆119Jul 28, 2024Updated last year
- Code for the arXiv preprint "The Unreasonable Effectiveness of Easy Training Data"☆48Jan 17, 2024Updated 2 years ago
- ☆25May 7, 2025Updated 10 months ago
- Rust Vector for large amounts of data, that does not copy when growing, by using full `mmap`'d pages.☆22Mar 15, 2024Updated last year
- Self-Supervised Alignment with Mutual Information☆20May 24, 2024Updated last year
- The Batched API provides a flexible and efficient way to process multiple requests in a batch, with a primary focus on dynamic batching o…☆159Jul 14, 2025Updated 7 months ago
- Repo for training MLMs, CLMs, or T5-type models on the OLM pretraining data, but it should work with any hugging face text dataset.☆96Feb 9, 2023Updated 3 years ago
- A Python library aimed at dissecting and augmenting NER training data.☆61May 11, 2023Updated 2 years ago
- ☆50Mar 14, 2024Updated last year
- PyLate efficient inference engine☆74Jan 7, 2026Updated 2 months ago
- MIO: A Foundation Model on Multimodal Tokens☆34Dec 13, 2024Updated last year
- This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers☆31Jan 13, 2024Updated 2 years ago
- Deita: Data-Efficient Instruction Tuning for Alignment [ICLR2024]☆589Dec 9, 2024Updated last year
- ☆36Feb 26, 2024Updated 2 years ago
- Codebase for fine-tuning Llama2 70B to generate math test questions and answers.☆11Aug 30, 2024Updated last year
- An application to demonstrate how can you make a RAG using pgvector and PostgreSQL☆28May 27, 2024Updated last year
- Supercharge huggingface transformers with model parallelism.☆78Jul 23, 2025Updated 7 months ago
- awesome synthetic (text) datasets☆325Jan 8, 2026Updated 2 months ago
- Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training☆132Apr 17, 2024Updated last year
- ☆73Apr 19, 2024Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆40Jun 10, 2024Updated last year
- Repository for go shared libraries (for now).☆11Dec 1, 2025Updated 3 months ago
- openASO is a project designed to identify regulatory regions of an RNA that can be targeted by antisense oligonucleotides.☆10Sep 30, 2021Updated 4 years ago
- Repo for paper "CODIS: Benchmarking Context-Dependent Visual Comprehension for Multimodal Large Language Models".☆12Oct 14, 2024Updated last year
- 🧠 A sample app to integrate react-native and open ai☆11Jan 1, 2023Updated 3 years ago
- Concurrency library☆17Oct 13, 2024Updated last year