henrywoo/minichatgpt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/henrywoo/minichatgpt)

henrywoo / minichatgpt

minichatgpt - To Train ChatGPT In 5 Minutes

☆166

Alternatives and similar repositories for minichatgpt

Users that are interested in minichatgpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

henrywoo / chatllama
View on GitHub
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
☆1,199Jan 18, 2025Updated last year
henrywoo / pyllama
View on GitHub
LLaMA: Open and Efficient Foundation Language Models
☆2,780Nov 8, 2023Updated 2 years ago
vicgalle / zero-shot-reward-models
View on GitHub
ZYN: Zero-Shot Reward Models with Yes-No Questions
☆34Aug 15, 2023Updated 2 years ago
uhh-lt / storyfinder
View on GitHub
Storyfinder - A Browser Plugin and Server Backend for Personalized Knowledge- and Information Management
☆18Jan 7, 2026Updated 6 months ago
kswamy15 / NLP_Tasks_PyLightning
View on GitHub
Showcasing various NLP Downstream tasks Training with pre-trained Language models using Pytorch Lightning
☆13Aug 7, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
markovianhq / convpy
View on GitHub
Library for lagged conversion rate estimation. Based on the paper "Modeling Delayed Feedback in Display Advertising", Chapelle, 2014.
☆14Mar 21, 2019Updated 7 years ago
qwopqwop200 / GPTQ-for-LLaMa
View on GitHub
4 bits quantization of LLaMA using GPTQ
☆3,071Jul 13, 2024Updated 2 years ago
snowolf / alpaca-on-amazon-sagemaker
View on GitHub
This is a sample about how to run stanford_alpaca on Amazon SageMaker, only for demo use.
☆14Jul 3, 2023Updated 3 years ago
Neutralzz / BiLLa
View on GitHub
BiLLa: A Bilingual LLaMA with Enhanced Reasoning Ability
☆415Jun 1, 2023Updated 3 years ago
allenai / hyperdecoders
View on GitHub
Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304
☆14Oct 11, 2022Updated 3 years ago
SWHL / LLaMADemo
View on GitHub
🎉LLaMA Demo 7B🎉
☆17Mar 23, 2023Updated 3 years ago
linhduongtuan / BLOOM-LORA
View on GitHub
Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigs…
☆183Jun 18, 2023Updated 3 years ago
26hzhang / SequenceToSequence
View on GitHub
A seq2seq with attention dialogue/MT model implemented by TensorFlow.
☆11Jul 17, 2018Updated 8 years ago
lingjzhu / spoken_sent_embedding
View on GitHub
Unsupervised spoken sentence embeddings
☆14Dec 14, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
daiyongya / markbert
View on GitHub
☆16Jul 29, 2022Updated 3 years ago
ssbuild / deep_training
View on GitHub
deep learning
☆150May 6, 2025Updated last year
lucasavila00 / LmScript
View on GitHub
Controllable Language Model Interactions in TypeScript
☆10May 17, 2024Updated 2 years ago
zphang / minimal-llama
View on GitHub
☆456Oct 15, 2023Updated 2 years ago
johnsmith0031 / alpaca_lora_4bit
View on GitHub
☆533Dec 1, 2023Updated 2 years ago
Ph0rk0z / text-generation-webui-testing
View on GitHub
A fork of textgen that kept some things like Exllama and old GPTQ.
☆22Aug 20, 2024Updated last year
TrustedLLM / TryMoreGPT
View on GitHub
☆25May 23, 2023Updated 3 years ago
NinedayWang / Self-Attentive-and-Gated-SLU
View on GitHub
Implementation of paper accepted by EMNLP 2018 using Pytorch named "A Self-Attentive Model with Gate Mechanism for Spoken Language Unders…
☆17Dec 11, 2018Updated 7 years ago
Lightning-AI / lit-llama
View on GitHub
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Ad…
☆6,082Jul 1, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
jorahn / llama-int8
View on GitHub
Quantized inference code for LLaMA models
☆13Mar 12, 2023Updated 3 years ago
l294265421 / alpaca-rlhf
View on GitHub
Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chat
☆118Jun 5, 2023Updated 3 years ago
thunlp / SememeWSD
View on GitHub
Code and data for the COLING 2020 paper "Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet"
☆14Dec 2, 2020Updated 5 years ago
NhanDoV / Kaggle-6-first-projects
View on GitHub
Related to NLP, data-processing & RNN :v
☆19Mar 12, 2024Updated 2 years ago
OFA-Sys / ExpertLLaMA
View on GitHub
An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.
☆298May 31, 2023Updated 3 years ago
tloen / alpaca-lora
View on GitHub
Instruct-tune LLaMA on consumer hardware
☆18,909Jul 29, 2024Updated last year
Instruction-Tuning-with-GPT-4 / GPT-4-LLM
View on GitHub
Instruction Tuning with GPT-4
☆4,332Jun 11, 2023Updated 3 years ago
binghong-ml / MolEvol
View on GitHub
Source code for ICLR 2021 paper: "Molecule Optimization by Explainable Evolution"
☆31May 29, 2021Updated 5 years ago
OpenBuddy / OpenBuddy
View on GitHub
Open Multilingual Chatbot for Everyone
☆1,273Jun 8, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
feizc / MLE-LLaMA
View on GitHub
Multi-language Enhanced LLaMA
☆301Apr 13, 2023Updated 3 years ago
fa0311 / bitsandbytes-windows
View on GitHub
8-bit CUDA functions for PyTorch
☆13May 8, 2023Updated 3 years ago
percent4 / tensorflow-serving_4_kashgari
View on GitHub
Using tensorflow/serving to deploy kashgari model for time training and predicting.
☆13Sep 16, 2019Updated 6 years ago
voidful / TextRL
View on GitHub
Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-…
☆564Apr 23, 2026Updated 2 months ago
dterjek / adversarial_lipschitz_regularization
View on GitHub
Adversarial Lipschitz Regularization
☆10Jun 10, 2021Updated 5 years ago
nebuly-ai / optimate
View on GitHub
A collection of libraries to optimise AI model performances
☆8,332Jul 22, 2024Updated 2 years ago
open-nlplab / fastIE
View on GitHub
Information Extraction related tools and models
☆10Mar 16, 2023Updated 3 years ago