shreyansh26 / LLM-SamplingLinks
A collection of various LLM sampling methods implemented in pure Pytorch
☆22Updated 10 months ago
Alternatives and similar repositories for LLM-Sampling
Users that are interested in LLM-Sampling are comparing it to the libraries listed below
Sorting:
- ☆48Updated last year
- Flexible library for merging large language models (LLMs) via evolutionary optimization (ACL 2025 Demo).☆88Updated 2 months ago
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆60Updated last year
- ☆80Updated this week
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆44Updated last year
- PyTorch library for Active Fine-Tuning☆93Updated 3 weeks ago
- A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.☆46Updated 3 months ago
- Supercharge huggingface transformers with model parallelism.☆77Updated 2 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆66Updated 3 weeks ago
- ☆119Updated last year
- An introduction to LLM Sampling☆79Updated 10 months ago
- [NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs☆92Updated 11 months ago
- Let's build better datasets, together!☆262Updated 10 months ago
- ☆39Updated last year
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆44Updated last year
- Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"☆84Updated 11 months ago
- ☆49Updated 8 months ago
- ☆52Updated last year
- Code for Zero-Shot Tokenizer Transfer☆138Updated 9 months ago
- A repository containing the code for translating popular LLM benchmarks to German.☆30Updated 2 years ago
- Code for PHATGOOSE introduced in "Learning to Route Among Specialized Experts for Zero-Shot Generalization"☆90Updated last year
- ☆52Updated last year
- ☆55Updated 11 months ago
- Aioli: A unified optimization framework for language model data mixing☆27Updated 9 months ago
- ☆31Updated 11 months ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆81Updated 10 months ago