jxmorris12 / gptzipLinks

Losslessly encode text natively with arithmetic coding and HuggingFace Transformers

☆77

Alternatives and similar repositories for gptzip

Users that are interested in gptzip are comparing it to the libraries listed below

Sorting:

xjdr-alt / muzero_sketch
☆40Updated last year
joshuacnf / Ctrl-G
☆104Updated 10 months ago
JD-P / minihf
MiniHF is an inference, human preference data collection, and fine-tuning tool for local language models. It is intended to help the user…
☆181Updated 2 weeks ago
notarussianteenager / srf-attention
Simplex Random Feature attention, in PyTorch
☆74Updated 2 years ago
idiap / sigma-gpt
σ-GPT: A New Approach to Autoregressive Models
☆69Updated last year
jxmorris12 / embzip
lossily compress representation vectors using product quantization
☆59Updated 3 weeks ago
bloc97 / DeMo
DeMo: Decoupled Momentum Optimization
☆197Updated 11 months ago
NousResearch / StripedHyenaTrainer
☆62Updated last year
HazyResearch / cartridges
Storing long contexts in tiny caches with self-study
☆216Updated last month
lucidrains / grokfast-pytorch
Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"
☆103Updated 10 months ago
JoeLi12345 / nGPT
an open source reproduction of NVIDIA's nGPT (Normalized Transformer with Representation Learning on the Hypersphere)
☆107Updated 8 months ago
SinatrasC / entropix-smollm
smolLM with Entropix sampler on pytorch
☆150Updated last year
xjdr-alt / llmri
look how they massacred my boy
☆63Updated last year
sumo43 / loopvlm
run paligemma in real time
☆133Updated last year
cloneofsimo / min-max-gpt
Minimal (400 LOC) implementation Maximum (multi-node, FSDP) GPT training
☆132Updated last year
facebookresearch / llm-speedrunner
The Automated LLM Speedrunning Benchmark measures how well LLM agents can reproduce previous innovations and discover new ones in languag…
☆111Updated last month
Zyphra / tree_attention
Tree Attention: Topology-aware Decoding for Long-Context Attention on GPU clusters
☆130Updated 11 months ago
tairov / QStarLearning.mojo
☆112Updated last year
Amplify-Partners / annotation-reading-list
A reading list of relevant papers and projects on foundation model annotation
☆28Updated 8 months ago
joey00072 / ohara
Collection of autoregressive model implementation
☆86Updated 6 months ago
KhoomeiK / complexity-scaling
gzip Predicts Data-dependent Scaling Laws
☆34Updated last year
Pleias / Quest-Best-Tokens
An introduction to LLM Sampling
☆79Updated 11 months ago
xjdr-alt / simple_transformer
Simple Transformer in Jax
☆139Updated last year
leloykun / modded-nanogpt
NanoGPT (124M) quality in 2.67B tokens
☆28Updated 2 months ago
epfml / DenseFormer
☆82Updated last year
RiddleHe / llm-interp
A collection of lightweight interpretability scripts to understand how LLMs think
☆66Updated last week
JoshuaPurtell / SmallBench
Small, simple agent task environments for training and evaluation
☆19Updated last year
Mihaiii / backtrack_sampler
An easy-to-understand framework for LLM samplers that rewind and revise generated tokens
☆145Updated 9 months ago
s-smits / grpo-optuna
Optimizing Causal LMs through GRPO with weighted reward functions and automated hyperparameter tuning using Optuna
☆58Updated last month
Alex-Gurung / ReasoningNCP
Official repo for Learning to Reason for Long-Form Story Generation
☆72Updated 7 months ago