SRDdev / PaLM-RLHFLinks
Implementation of RLHF (Reinforcement Learning with Human Feedback) on top of the PaLM architecture. Basically ChatGPT but with PaLM
β30Updated 11 months ago
Alternatives and similar repositories for PaLM-RLHF
Users that are interested in PaLM-RLHF are comparing it to the libraries listed below
Sorting:
- Create soft prompts for fairseq 13B dense, GPT-J-6B and GPT-Neo-2.7B for free in a Google Colab TPU instanceβ28Updated 2 years ago
- π€Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.β56Updated 3 years ago
- Python examples using the bigcode/tiny_starcoder_py 159M model to generate codeβ45Updated 2 years ago
- The first AI artistβ32Updated 2 years ago
- Automated prompting and scoring framework to evaluate LLMs using updated human knowledge promptsβ109Updated 2 years ago
- A collection of character cards for use in AI Roleplayingβ85Updated 2 years ago
- Drop in replacement for OpenAI, but with Open models.β153Updated 2 years ago
- A web app to experiment with chained prompts faster.β17Updated 2 years ago
- An all-new Language Model That Processes Ultra-Long Sequences of 100,000+ Ultra-Fastβ150Updated last year
- Experimental sampler to make LLMs more creativeβ31Updated 2 years ago
- Reweight GPT - a simple neural network using transformer architecture for next character predictionβ58Updated 2 years ago
- Conversational Language model toolkit for training against human preferences.β42Updated last year
- The Next Generation Multi-Modality Superintelligenceβ70Updated last year
- A cog implementation of MosaicML's MPT-7B-StoryWriter-65k+ Large Language Modelβ57Updated 2 years ago
- Stable diffusion google colab kernelβ10Updated 3 years ago
- A collection of prompts for Llamaβ101Updated 2 years ago
- Command-line script for inferencing from models such as LLaMA, in a chat scenario, with LoRA adaptationsβ33Updated 2 years ago
- WebLLM Chrome Extension Starter Pack.β12Updated 2 years ago
- Embeddings focused small version of Llama NLP modelβ107Updated 2 years ago
- ToK aka Tree of Knowledge for Large Language Models LLM. It's a novel dataset that inspires knowledge symbolic correlation in simple inpuβ¦β54Updated 2 years ago
- A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.β64Updated 2 years ago
- PegasusX: The Future of Multimodal Embeddings π¦ π¦β14Updated last year
- β10Updated 2 years ago
- β128Updated 2 years ago
- Neural search engine for discovering semantically similar Python repositories on GitHubβ26Updated last year
- A library for incremental loading of large PyTorch checkpointsβ56Updated 2 years ago
- Instruct-tuning LLaMA on consumer hardwareβ66Updated 2 years ago
- OpenPipe Reinforcement Learning Experimentsβ32Updated 9 months ago
- Yet Another LLaMA/ALPACA Discord Botβ69Updated 2 years ago
- Smol but mighty language modelβ63Updated 2 years ago