iPieter / universal-distillationLinks
🧪Create domain-adapted language models by distilling from many pre-trained LMs
☆10Updated 2 years ago
Alternatives and similar repositories for universal-distillation
Users that are interested in universal-distillation are comparing it to the libraries listed below
Sorting:
- ☆15Updated 3 months ago
- Repository for Skill Set Optimization☆14Updated 11 months ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆43Updated last year
- Minimum Description Length probing for neural network representations☆18Updated 5 months ago
- Script for processing OpenAI's PRM800K process supervision dataset into an Alpaca-style instruction-response format☆27Updated 2 years ago
- Repository containing the SPIN experiments on the DIBT 10k ranked prompts☆24Updated last year
- Official repo for NAACL 2024 Findings paper "LeTI: Learning to Generate from Textual Interactions."☆65Updated 2 years ago
- This is official project in our paper: Is Bigger and Deeper Always Better? Probing LLaMA Across Scales and Layers☆30Updated last year
- Measuring and Controlling Persona Drift in Language Model Dialogs☆17Updated last year
- Official Repository for Task-Circuit Quantization☆20Updated last month
- The code implementation of MAGDi: Structured Distillation of Multi-Agent Interaction Graphs Improves Reasoning in Smaller Language Models…☆35Updated last year
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Updated 2 years ago
- A Data Source for Reasoning Embodied Agents☆19Updated last year
- Tasks for describing differences between text distributions.☆16Updated 11 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 6 months ago
- Plug-and-play Search Interfaces with Pyserini and Hugging Face☆32Updated last year
- ☆13Updated 7 months ago
- 🦖 X—LLM: Simple & Cutting Edge LLM Finetuning☆11Updated last year
- This library supports evaluating disparities in generated image quality, diversity, and consistency between geographic regions.☆20Updated last year
- Implementation of the model: "Reka Core, Flash, and Edge: A Series of Powerful Multimodal Language Models" in PyTorch☆30Updated 2 weeks ago
- SWIM-IR is a Synthetic Wikipedia-based Multilingual Information Retrieval training set with 28 million query-passage pairs spanning 33 la…☆48Updated last year
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆25Updated last month
- ☆20Updated 3 months ago
- ☆14Updated 9 months ago
- Starbucks: Improved Training for 2D Matryoshka Embeddings☆21Updated 2 weeks ago
- ☆28Updated last week
- Byte-sized text games for code generation tasks on virtual environments☆19Updated last year
- [NeurIPS 2023 Main Track] This is the repository for the paper titled "Don’t Stop Pretraining? Make Prompt-based Fine-tuning Powerful Lea…☆74Updated last year
- ReBase: Training Task Experts through Retrieval Based Distillation☆29Updated 5 months ago
- ☆24Updated 4 months ago