harubaru / convogptLinks

Conversational Language model toolkit for training against human preferences.

☆42

Alternatives and similar repositories for convogpt

Users that are interested in convogpt are comparing it to the libraries listed below

Sorting:

hitomi-team / shimeji
Platform and API Agnostic library for powering chatbots
☆24Updated 2 years ago
finetunej / gpt-neo_dungeon
Colab notebooks to run a basic AI Dungeon clone using gpt-neo-2.7B
☆62Updated 4 years ago
PygmalionAI / data-toolbox
Our data munging code.
☆34Updated last month
harrisonvanderbyl / rwkvstic
Framework agnostic python runtime for RWKV models
☆147Updated 2 years ago
PygmalionAI / logbooks
Where we keep our notes about model training runs.
☆16Updated 2 years ago
zarakiquemparte / zaraki-tools
☆27Updated 2 years ago
the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated 2 years ago
josephrocca / rwkv-v4-web
BlinkDL's RWKV-v4 running in the browser
☆47Updated 2 years ago
hitomi-team / sukima
A ready-to-deploy container for implementing an easy to use REST API to access Language Models.
☆66Updated 2 years ago
VE-FORBRYDERNE / mtj-softtuner
Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instance
☆28Updated 2 years ago
TehVenomm / LM_Transformers_BlockMerge
Image Diffusion block merging technique applied to transformers based Language Models.
☆56Updated 2 years ago
hizkifw / WebChatRWKVstic
ChatGPT-like Web UI for RWKVstic
☆100Updated 2 years ago
finetunej / transformers
🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.
☆56Updated 3 years ago
BlinkDL / WorldModel
Let us make Psychohistory (as in Asimov) a reality, and accessible to everyone. Useful for LLM grounding and games / fiction / business /…
☆40Updated 2 years ago
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆108Updated last year
AXKuhta / rwkv-onnx-dml
Run ONNX RWKV-v4 models with GPU acceleration using DirectML [Windows], or just on CPU [Windows AND Linux]; Limited to 430M model at this…
☆21Updated 2 years ago
harrisonvanderbyl / rwkv_chatbot
rwkv_chatbot
☆62Updated 2 years ago
DeXtmL / bitsandbytes-win-prebuilt
☆75Updated 3 years ago
aicrumb / doohickey
Doohickey is a stable diffusion tool for technical artists who want to stay up-to-date with the latest developments in the field.
☆40Updated 2 years ago
eugenepentland / landmark-attention-qlora
Landmark Attention: Random-Access Infinite Context Length for Transformers QLoRA
☆124Updated 2 years ago
BlinkDL / RWKV-v2-RNN-Pile
RWKV-v2-RNN trained on the Pile. See https://github.com/BlinkDL/RWKV-LM for details.
☆67Updated 3 years ago
waifu-diffusion / network-trainer
☆27Updated 2 years ago
DamascusGit / stable-diffusion
k_diffusion wrapper included for k_lms sampling. fixed for notebook.
☆21Updated 2 years ago
ConiferLabsWA / flan-ul2-alpaca
☆33Updated 2 years ago
NovelAI / novelai-tokenizer
Sentencepiece based BPE tokenizer for English and Japanese language text.
☆28Updated last year
AeroScripts / HiddenEngrams
Hidden Engrams: Long Term Memory for Transformer Model Inference
☆35Updated 4 years ago
Birch-san / diffusers-play
Repository with which to explore k-diffusion and diffusers, and within which changes to said packages may be tested.
☆54Updated last year
cwhy / rwkv-decon
Trying to deconstruct RWKV in understandable terms
☆14Updated 2 years ago
mayank31398 / GPTQ-for-SantaCoder
4 bits quantization of SantaCoder using GPTQ
☆51Updated 2 years ago
0cc4m / GPTQ-for-LLaMa
4 bits quantization of LLMs using GPTQ
☆49Updated 2 years ago