shreyansh26 / LLM-SamplingLinks

A collection of various LLM sampling methods implemented in pure Pytorch

☆23

Alternatives and similar repositories for LLM-Sampling

Users that are interested in LLM-Sampling are comparing it to the libraries listed below

Sorting:

ContextualAI / CLAIR_and_APO
Anchored Preference Optimization and Contrastive Revisions: Addressing Underspecification in Alignment
☆57Updated 9 months ago
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
allenai / infinigram-api
☆61Updated 3 weeks ago
MadryLab / platinum-benchmarks
☆29Updated 2 months ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆47Updated 4 months ago
penfever / wildchat-50m
Code, results and other artifacts from the paper introducing the WildChat-50m dataset and the Re-Wild model family.
☆29Updated 2 months ago
GenRobo / MatMamba
Code and pretrained models for the paper: "MatMamba: A Matryoshka State Space Model"
☆59Updated 7 months ago
Zyphra / Zyda_processing
☆35Updated last year
evanatyourservice / llm-jax
Train a SmolLM-style llm on fineweb-edu in JAX/Flax with an assortment of optimizers.
☆17Updated 3 months ago
melisa-writer / short-transformers
Prune transformer layers
☆69Updated last year
LAGoM-NLP / transtokenizer
☆48Updated 5 months ago
SeunghyunSEO / optimized_hf_llama_class_for_training
☆47Updated 10 months ago
Knowledgator / FlashDeBERTa
Trully flash implementation of DeBERTa disentangled attention mechanism.
☆59Updated last month
KindXiaoming / physics_of_skill_learning
We study toy models of skill learning.
☆28Updated 5 months ago
lilakk / BLEUBERI
Official repository for "BLEUBERI: BLEU is a surprisingly effective reward for instruction following"
☆23Updated 3 weeks ago
kyleliang919 / Online-Subspace-Descent
This repo is based on https://github.com/jiaweizzhao/GaLore
☆29Updated 9 months ago
euclaise / supertrainer2000
☆49Updated last year
amirzandieh / HyperAttention
Triton Implementation of HyperAttention Algorithm
☆48Updated last year
TristanThrush / i-am-a-strange-dataset
Repository for "I am a Strange Dataset: Metalinguistic Tests for Language Models"
☆44Updated last year
krypticmouse / matryoshka-representation-learning
PyTorch implementation for MRL
☆18Updated last year
arcee-ai / DAM
☆51Updated 7 months ago
luyug / magix
Supercharge huggingface transformers with model parallelism.
☆77Updated 8 months ago
AnswerDotAI / fastkmeans
☆61Updated last week
hamishivi / EasyLM
Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Fl…
☆75Updated 10 months ago
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆63Updated 2 months ago
deshwalmahesh / PHUDGE
Official repo for the paper PHUDGE: Phi-3 as Scalable Judge. Evaluate your LLMs with or without custom rubric, reference answer, absolute…
☆49Updated 11 months ago
EleutherAI / rnngineering
Engineering the state of RNN language models (Mamba, RWKV, etc.)
☆32Updated last year
RobertCsordas / moe
Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"
☆38Updated 2 weeks ago
joey00072 / ohara
Collection of autoregressive model implementation
☆85Updated 2 months ago
jonhue / activeft
PyTorch library for Active Fine-Tuning
☆80Updated 4 months ago