wbrown / gpt_bpe
GPT2 Byte Pair Encoding implementation in Golang
☆24Updated 2 weeks ago
Alternatives and similar repositories for gpt_bpe:
Users that are interested in gpt_bpe are comparing it to the libraries listed below
- Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit☆63Updated last year
- Hidden Engrams: Long Term Memory for Transformer Model Inference☆35Updated 3 years ago
- RWKV (Receptance Weighted Key Value) is a RNN with Transformer-level performance☆41Updated 2 years ago
- ☆39Updated 2 years ago
- Here we collect trick questions and failed tasks for open source LLMs to improve them.☆32Updated 2 years ago
- ☆16Updated last year
- Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…☆40Updated 2 years ago
- Binding to transformers in ggml☆61Updated last month
- Backend for the diffusion-ui frontend☆25Updated last year
- RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.☆67Updated 2 years ago
- Updated 2 months ago
- ☆26Updated 2 years ago
- Latent Diffusion Language Models☆68Updated last year
- Trying to deconstruct RWKV in understandable terms☆14Updated 2 years ago
- ☆40Updated 2 years ago
- ☆49Updated last year
- ☆21Updated 6 months ago
- jupyter/colab implementation of stable-diffusion using k_lms sampler, cpu draw manual seeding, and quantize.py fix☆38Updated 2 years ago
- llm sampler that only allows words that are in the bible☆26Updated 5 months ago
- The GeoV model is a large langauge model designed by Georges Harik and uses Rotary Positional Embeddings with Relative distances (RoPER).…☆122Updated 2 years ago
- GGML implementation of BERT model with Python bindings and quantization.☆56Updated last year
- Majesty Diffusion by @Dango233 and @apolinario (@multimodalart)☆25Updated 2 years ago
- Code repository for the c-BTM paper☆106Updated last year
- tinygrad port of the RWKV large language model.☆44Updated 2 months ago
- A library for squeakily cleaning and filtering language datasets.☆47Updated last year
- SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia☆41Updated 2 years ago
- ANE accelerated embedding models!☆16Updated 4 months ago
- Fast inference of Instruct tuned LLaMa on your personal devices.☆22Updated 2 years ago
- A library for incremental loading of large PyTorch checkpoints☆56Updated 2 years ago
- Modified Stanford-Alpaca Trainer for Training Replit's Code Model☆40Updated last year