wbrown / gpt_bpeLinks

GPT2 Byte Pair Encoding implementation in Golang

☆24

Alternatives and similar repositories for gpt_bpe

Users that are interested in gpt_bpe are comparing it to the libraries listed below

Sorting:

nlpodyssey / rwkv
RWKV (Receptance Weighted Key Value) is a RNN with Transformer-level performance
☆41Updated 2 years ago
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated 2 years ago
go-skynet / go-ggml-transformers.cpp
Binding to transformers in ggml
☆62Updated last month
AeroScripts / HiddenEngrams
Hidden Engrams: Long Term Memory for Transformer Model Inference
☆35Updated 4 years ago
NolanoOrg / llama-int4-quant
☆26Updated 2 years ago
teknium1 / stanford_alpaca-replit
Modified Stanford-Alpaca Trainer for Training Replit's Code Model
☆40Updated 2 years ago
AXKuhta / rwkv-onnx-dml
Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…
☆21Updated 2 years ago
sekstini / basedxl
☆18Updated last year
NolanoOrg / sparse_quant_llms
SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia
☆41Updated 2 years ago
euclaise / SlimTrainer
Full finetuning of large language models without large memory requirements
☆94Updated last year
BlinkDL / WorldModel
Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…
☆40Updated 2 years ago
cwhy / rwkv-decon
Trying to deconstruct RWKV in understandable terms
☆14Updated 2 years ago
ConiferLabsWA / flan-ul2-alpaca
☆32Updated 2 years ago
ArEnSc / Production-RWKV
This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…
☆64Updated 2 years ago
zarakiquemparte / zaraki-tools
☆27Updated last year
Narsil / bloomserver
☆39Updated 2 years ago
wozeparrot / tinyrwkv
tinygrad port of the RWKV large language model.
☆46Updated 3 months ago
enjalot / latent-sae
Training code for Sparse Autoencoders on Embedding models
☆38Updated 3 months ago
Birch-san / mpt-play
Command-line script for inferencing from models such as MPT-7B-Chat
☆101Updated last year
sfcompute / tinynarrations
A synthetic story narration dataset to study small audio LMs.
☆32Updated last year
geov-ai / geov
The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…
☆121Updated 2 years ago
mrsteyk / RWKV-LM-deepspeed
☆42Updated 2 years ago
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated last year
LAION-AI / Anh
Anh - LAION's multilingual assistant datasets and models
☆27Updated 2 years ago
abetlen / program-constrained-language-model-sampling
☆35Updated 2 years ago
aicrumb / doohickey
Doohickey is a stable diffusion tool for technical artists who want to stay up-to-date with the latest developments in the field.
☆40Updated 2 years ago
kir-gadjello / zipslicer
A library for incremental loading of large PyTorch checkpoints
☆56Updated 2 years ago
mayank31398 / GPTQ-for-SantaCoder
4 bits quantization of SantaCoder using GPTQ
☆51Updated 2 years ago
NolanoOrg / InstructLLaMa.cpp
Fast inference of Instruct tuned LLaMa on your personal devices.
☆22Updated 2 years ago
lachlansneff / sparsellama
☆40Updated 2 years ago