juncongmoo / minichatgpt
minichatgpt - To Train ChatGPT In 5 Minutes
☆167Updated last year
Alternatives and similar repositories for minichatgpt:
Users that are interested in minichatgpt are comparing it to the libraries listed below
- Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.☆50Updated last year
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆212Updated 9 months ago
- The data processing pipeline for the Koala chatbot language model☆117Updated last year
- ☆124Updated last year
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆184Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs☆130Updated 8 months ago
- Official repository for LongChat and LongEval☆516Updated 9 months ago
- Instruct-tune LLaMA on consumer hardware with shareGPT data☆125Updated last year
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆177Updated last year
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆208Updated last year
- ☆457Updated last year
- ☆126Updated last year
- A dataset featuring diverse dialogues between two ChatGPT (gpt-3.5-turbo) instances with system messages written by GPT-4. Covering vario…☆166Updated last year
- Code and models for BERT on STILTs☆53Updated last year
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆299Updated last year
- This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/…☆96Updated last year
- Multi-language Enhanced LLaMA☆301Updated last year
- ☆268Updated last year
- Simple implementation of using lora form the peft library to fine-tune the chatglm-6b☆85Updated last year
- Implementation of Reinforcement Learning from Human Feedback (RLHF)☆171Updated last year
- A dataset for training/evaluating Question Answering Retrieval models on ChatGPT responses with the possibility to training/evaluating on…☆140Updated last year
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆51Updated last year
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆110Updated last year
- A minimum example of aligning language models with RLHF similar to ChatGPT☆217Updated last year
- A Multi-Turn Dialogue Corpus based on Alpaca Instructions☆169Updated last year
- Leveraging large language models for text-to-SQL synthesis, this project fine-tunes WizardLM/WizardCoder-15B-V1.0 with QLoRA on a custom …☆43Updated last year
- MultilingualShareGPT, the free multi-language corpus for LLM training☆72Updated last year
- 全球首个StableVicuna中文优化版。☆64Updated last year
- fastertransformer for codegeex model☆63Updated last year
- Open efforts to implement ChatGPT-like models and beyond.☆107Updated 7 months ago