juncongmoo / minichatgpt
minichatgpt - To Train ChatGPT In 5 Minutes
☆167Updated last year
Related projects ⓘ
Alternatives and complementary repositories for minichatgpt
- ☆120Updated 11 months ago
- Train llama with lora on one 4090 and merge weight of lora to work as stanford alpaca.☆50Updated last year
- A full pipeline to finetune Vicuna LLM with LoRA and RLHF on consumer hardware. Implementation of RLHF (Reinforcement Learning with Human…☆207Updated 5 months ago
- Open efforts to implement ChatGPT-like models and beyond.☆105Updated 3 months ago
- This is a text generation method which returns a generator, streaming out each token in real-time during inference, based on Huggingface/…☆96Updated 8 months ago
- Official codebase for "SelFee: Iterative Self-Revising LLM Empowered by Self-Feedback Generation"☆220Updated last year
- Open Instruction Generalist is an assistant trained on massive synthetic instructions to perform many millions of tasks☆206Updated 10 months ago
- Multi-language Enhanced LLaMA☆301Updated last year
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…☆183Updated last year
- a Fine-tuned LLaMA that is Good at Arithmetic Tasks☆174Updated last year
- The data processing pipeline for the Koala chatbot language model☆117Updated last year
- Scalable PaLM implementation of PyTorch☆192Updated last year
- Code and models for BERT on STILTs☆53Updated last year
- Official repository for LongChat and LongEval☆512Updated 5 months ago
- Crosslingual Generalization through Multitask Finetuning☆515Updated last month
- A high-throughput and memory-efficient inference and serving engine for LLMs☆129Updated 4 months ago
- Simple implementation of using lora form the peft library to fine-tune the chatglm-6b☆86Updated last year
- Instruct-tune LLaMA on consumer hardware with shareGPT data☆121Updated last year
- A minimum example of aligning language models with RLHF similar to ChatGPT☆213Updated last year
- train llama on a single A100 80G node using 🤗 transformers and 🚀 Deepspeed Pipeline Parallelism☆207Updated 11 months ago
- ☆453Updated last year
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.☆295Updated last year
- ☆263Updated last year
- The paddle implementation of meta's LLaMA.☆44Updated last year
- CodeGen2 models for program synthesis☆273Updated last year
- OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA☆301Updated last year
- Fast Inference Solutions for BLOOM☆560Updated last month
- MultilingualShareGPT, the free multi-language corpus for LLM training☆72Updated last year
- LLaMa Tuning with Stanford Alpaca Dataset using Deepspeed and Transformers☆51Updated last year
- Inference script for Meta's LLaMA models using Hugging Face wrapper☆111Updated last year