recursal / ai-town-rwkv-proxyLinks
Run a large AI town, locally, via RWKV !
☆156Updated last year
Alternatives and similar repositories for ai-town-rwkv-proxy
Users that are interested in ai-town-rwkv-proxy are comparing it to the libraries listed below
Sorting:
- An unsupervised model merging algorithm for Transformers-based language models.☆105Updated last year
- Generative Agents: Interactive Simulacra of Human Behavior☆99Updated last year
- ☆114Updated 6 months ago
- ☆73Updated last year
- Generate Synthetic Data Using OpenAI, MistralAI or AnthropicAI☆222Updated last year
- The one who calls upon functions - Function-Calling Language Model☆36Updated last year
- Let's create synthetic textbooks together :)☆75Updated last year
- A custom AI-town with cats. Based on https://github.com/a16z-infra/AI-town☆140Updated last year
- ☆51Updated last week
- This is our own implementation of 'Layer Selective Rank Reduction'☆239Updated last year
- ☆157Updated 11 months ago
- Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA☆123Updated 2 years ago
- Low-Rank adapter extraction for fine-tuned transformers models☆173Updated last year
- This project is established for real-time training of the RWKV model.☆49Updated last year
- Scripts to create your own moe models using mlx☆90Updated last year
- Experimental LLM Inference UX to aid in creative writing☆114Updated 6 months ago
- Harnessing the Memory Power of the Camelids☆146Updated last year
- Merge Transformers language models by use of gradient parameters.☆206Updated 10 months ago
- All the world is a play, we are but actors in it.☆50Updated this week
- RWKV infctx trainer, for training arbitary context sizes, to 10k and beyond!☆148Updated 10 months ago
- Full finetuning of large language models without large memory requirements☆94Updated last year
- Provide a way to use the GPT-QLLama model as an API☆43Updated 2 years ago
- entropix style sampling + GUI☆26Updated 7 months ago
- A multimodal, function calling powered LLM webui.☆214Updated 9 months ago
- CHAracter State Management - a generative text adventure☆43Updated 3 weeks ago
- an implementation of Self-Extend, to expand the context window via grouped attention☆119Updated last year
- A guidance language for controlling large language models.☆45Updated 2 years ago
- Model REVOLVER, a human in the loop model mixing system.☆33Updated last year
- GPT-2 small trained on phi-like data☆66Updated last year
- Image Diffusion block merging technique applied to transformers based Language Models.☆54Updated 2 years ago