shreyansh26 / LLM-SamplingLinks
A collection of various LLM sampling methods implemented in pure Pytorch
☆23Updated 6 months ago
Alternatives and similar repositories for LLM-Sampling
Users that are interested in LLM-Sampling are comparing it to the libraries listed below
Sorting:
- Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment☆57Updated 9 months ago
- A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.☆33Updated last year
- ☆61Updated 3 weeks ago
- ☆29Updated 2 months ago
- ☆47Updated 4 months ago
- Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.☆29Updated 2 months ago
- Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"☆59Updated 7 months ago
- ☆35Updated last year
- Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.☆17Updated 3 months ago
- Prune transformer layers☆69Updated last year
- ☆48Updated 5 months ago
- ☆47Updated 10 months ago
- Trully flash implementation of DeBERTa disentangled attention mechanism.☆59Updated last month
- We study toy models of skill learning.☆28Updated 5 months ago
- Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"☆23Updated 3 weeks ago
- This repo is based on https://github.com/jiaweizzhao/GaLore☆29Updated 9 months ago
- ☆49Updated last year
- Triton Implementation of HyperAttention Algorithm☆48Updated last year
- Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"☆44Updated last year
- PyTorch implementation for MRL☆18Updated last year
- ☆51Updated 7 months ago
- Supercharge huggingface transformers with model parallelism.☆77Updated 8 months ago
- ☆61Updated last week
- Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…☆75Updated 10 months ago
- Official repo for Learning to Reason for Long-Form Story Generation☆63Updated 2 months ago
- Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…☆49Updated 11 months ago
- Engineering the state of RNN language models (Mamba, RWKV, etc.)☆32Updated last year
- Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"☆38Updated 2 weeks ago
- Collection of autoregressive model implementation☆85Updated 2 months ago
- PyTorch library for Active Fine-Tuning☆80Updated 4 months ago