BlinkDL / WorldModelLinks

Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business / finance / governance, and can align agents with human too.

☆40

Alternatives and similar repositories for WorldModel

Users that are interested in WorldModel are comparing it to the libraries listed below

Sorting:

kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated 2 years ago
ArEnSc / Production-RWKV
This project aims to make RWKV Accessible to everyone using a Hugging Face like interface, while keeping it close to the R and D RWKV bra…
☆65Updated 2 years ago
BlinkDL / LM-Trick-Questions
Here we collect trick questions and failed tasks for open source LLMs to improve them.
☆32Updated 2 years ago
BlinkDL / RWKV-v2-RNN-Pile
RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.
☆67Updated 3 years ago
NolanoOrg / sparse_quant_llms
SparseGPT + GPTQ Compression of LLMs like LLaMa, OPT, Pythia
☆41Updated 2 years ago
harrisonvanderbyl / rwkvstic
Framework agnostic python runtime for RWKV models
☆147Updated 2 years ago
Zyphra / Zyda_processing
☆39Updated last year
mrsteyk / RWKV-LM-deepspeed
☆43Updated 2 years ago
proger / hippogriff
Griffin MQA + Hawk Linear RNN Hybrid
☆89Updated last year
kyleliang919 / Long-context-transformers
Exploring finetuning public checkpoints on filter 8K sequences on Pile
☆116Updated 2 years ago
codekansas / rwkv
RWKV model implementation
☆38Updated 2 years ago
euclaise / supertrainer2000
☆50Updated last year
recursal / GoldFinch-paper
GoldFinch and other hybrid transformer components
☆45Updated last year
BlinkDL / nanoRWKV
RWKV in nanoGPT style
☆196Updated last year
RWKV / RWKV-infctx-trainer
RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!
☆147Updated last year
LegallyCoder / mamba-hf
Implementation of the Mamba SSM with hf_integration.
☆56Updated last year
BBuf / RWKV-World-HF-Tokenizer
☆34Updated last year
kernelmachine / cbtm
Code repository for the c-BTM paper
☆108Updated 2 years ago
BlinkDL / modded-nanogpt-rwkv
RWKV-7: Surpassing GPT
☆100Updated last year
crowsonkb / LDLM
Latent Diffusion Language Models
☆70Updated 2 years ago
RWKV / ZeroCoT
https://x.com/BlinkDL_AI/status/1884768989743882276
☆28Updated 7 months ago
LAION-AI / riverbed
Tools for content datamining and NLP at scale
☆44Updated last year
RobertCsordas / moe
Official repository for the paper "Approximating Two-Layer Feedforward Networks for Efficient Transformers"
☆38Updated 5 months ago
sekstini / basedxl
☆18Updated last year
lucidrains / transformer-lm-gan
Explorations into adversarial losses on top of autoregressive loss for language modeling
☆38Updated 9 months ago
RyokoAI / BigKnow2022
BigKnow2022: Bringing Language Models Up to Speed
☆16Updated 2 years ago
SmerkyG / gptcore
Fast modular code to create and train cutting edge LLMs
☆68Updated last year
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆40Updated last year
Birch-san / booru-embed
[WIP] Transformer to embed Danbooru labelsets
☆13Updated last year
EleutherAI / improved-t5
Experiments for efforts to train a new and improved t5
☆76Updated last year