minichatgpt - To Train ChatGPT In 5 Minutes
β168Jul 29, 2023Updated 2 years ago
Alternatives and similar repositories for minichatgpt
Users that are interested in minichatgpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ChatLLaMA π’ Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPTβ1,201Jan 18, 2025Updated last year
- LLaMA: Open and Efficient Foundation Language Modelsβ2,791Nov 8, 2023Updated 2 years ago
- ZYN: Zero-Shot Reward Models with Yes-No Questionsβ35Aug 15, 2023Updated 2 years ago
- Official implementation of the ACL Findings 2023 paper: Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarizatioβ¦β14Jan 25, 2024Updated 2 years ago
- β10Aug 9, 2018Updated 7 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Library for lagged conversion rate estimation. Based on the paper "Modeling Delayed Feedback in Display Advertising", Chapelle, 2014.β14Mar 21, 2019Updated 7 years ago
- BiLLa: A Bilingual LLaMA with Enhanced Reasoning Abilityβ416Jun 1, 2023Updated 2 years ago
- This is a sample about how to run stanford_alpaca on Amazon SageMaker, only for demo use.β14Jul 3, 2023Updated 2 years ago
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigsβ¦β182Jun 18, 2023Updated 2 years ago
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304β14Oct 11, 2022Updated 3 years ago
- 4 bits quantization of LLaMA using GPTQβ3,073Jul 13, 2024Updated last year
- A seq2seq with attention dialogue/MT model implemented by TensorFlow.β11Jul 17, 2018Updated 7 years ago
- Unsupervised spoken sentence embeddingsβ14Dec 14, 2022Updated 3 years ago
- β17Nov 27, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off β’ AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Official code for "Seeing Faces in Things: A Model and Dataset for Pareidolia" ECCV 2024β21Sep 25, 2024Updated last year
- deep learningβ150May 6, 2025Updated 10 months ago
- Controllable Language Model Interactions in TypeScriptβ10May 17, 2024Updated last year
- β457Oct 15, 2023Updated 2 years ago
- β25May 23, 2023Updated 2 years ago
- A GPU Cluster Simulator for Distributed Deep Learning Training.β11Jan 15, 2022Updated 4 years ago
- Implementation of paper accepted by EMNLP 2018 using Pytorch named "A Self-Attentive Model with Gate Mechanism for Spoken Language Undersβ¦β17Dec 11, 2018Updated 7 years ago
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.β302May 31, 2023Updated 2 years ago
- Code and data for the COLING 2020 paper "Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet"β14Dec 2, 2020Updated 5 years ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adβ¦β6,081Jul 1, 2025Updated 8 months ago
- β535Dec 1, 2023Updated 2 years ago
- Quantized inference code for LLaMA modelsβ13Mar 12, 2023Updated 3 years ago
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chatβ118Jun 5, 2023Updated 2 years ago
- Open Multilingual Chatbot for Everyoneβ1,277Jun 8, 2025Updated 9 months ago
- Multi-language Enhanced LLaMAβ303Apr 13, 2023Updated 2 years ago
- Code base for internal reward models and PPO trainingβ24Oct 1, 2023Updated 2 years ago
- Instruction Tuning with GPT-4β4,336Jun 11, 2023Updated 2 years ago
- Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-β¦β565May 9, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023β12Dec 13, 2023Updated 2 years ago
- Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMsβ40Jan 30, 2024Updated 2 years ago
- Using tensorflow/serving to deploy kashgari model for time training and predicting.β13Sep 16, 2019Updated 6 years ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement β¦β34May 2, 2025Updated 10 months ago
- Adversarial Lipschitz Regularizationβ10Jun 10, 2021Updated 4 years ago
- Instruct-tune LLaMA on consumer hardwareβ18,959Jul 29, 2024Updated last year
- A collection of libraries to optimise AI model performancesβ8,352Jul 22, 2024Updated last year