minichatgpt - To Train ChatGPT In 5 Minutes
β166Jul 29, 2023Updated 2 years ago
Alternatives and similar repositories for minichatgpt
Users that are interested in minichatgpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ChatLLaMA π’ Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPTβ1,201Jan 18, 2025Updated last year
- LLaMA: Open and Efficient Foundation Language Modelsβ2,787Nov 8, 2023Updated 2 years ago
- ZYN: Zero-Shot Reward Models with Yes-No Questionsβ35Aug 15, 2023Updated 2 years ago
- Official implementation of the ACL Findings 2023 paper: Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarizatioβ¦β14Jan 25, 2024Updated 2 years ago
- Storyfinder - A Browser Plugin and Server Backend for Personalized Knowledge- and Information Managementβ18Jan 7, 2026Updated 3 months ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- BiLLa: A Bilingual LLaMA with Enhanced Reasoning Abilityβ416Jun 1, 2023Updated 2 years ago
- This is a sample about how to run stanford_alpaca on Amazon SageMaker, only for demo use.β14Jul 3, 2023Updated 2 years ago
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigsβ¦β183Jun 18, 2023Updated 2 years ago
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304β14Oct 11, 2022Updated 3 years ago
- πLLaMA Demo 7Bπβ17Mar 23, 2023Updated 3 years ago
- 4 bits quantization of LLaMA using GPTQβ3,072Jul 13, 2024Updated last year
- A seq2seq with attention dialogue/MT model implemented by TensorFlow.β11Jul 17, 2018Updated 7 years ago
- Unsupervised spoken sentence embeddingsβ14Dec 14, 2022Updated 3 years ago
- β16Jul 29, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Official code for "Seeing Faces in Things: A Model and Dataset for Pareidolia" ECCV 2024β21Sep 25, 2024Updated last year
- deep learningβ150May 6, 2025Updated 11 months ago
- Controllable Language Model Interactions in TypeScriptβ10May 17, 2024Updated last year
- β457Oct 15, 2023Updated 2 years ago
- A fork of textgen that kept some things like Exllama and old GPTQ.β22Aug 20, 2024Updated last year
- β25May 23, 2023Updated 2 years ago
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.β302May 31, 2023Updated 2 years ago
- Code and data for the COLING 2020 paper "Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet"β14Dec 2, 2020Updated 5 years ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adβ¦β6,082Jul 1, 2025Updated 9 months ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- β536Dec 1, 2023Updated 2 years ago
- Source code for ICLR 2021 paper: "Molecule Optimization by Explainable Evolution"β31May 29, 2021Updated 4 years ago
- Quantized inference code for LLaMA modelsβ13Mar 12, 2023Updated 3 years ago
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chatβ118Jun 5, 2023Updated 2 years ago
- Open Multilingual Chatbot for Everyoneβ1,277Jun 8, 2025Updated 10 months ago
- Multi-language Enhanced LLaMAβ302Apr 13, 2023Updated 3 years ago
- Create Persona dataset from reddit en movie category commentβ11Aug 6, 2021Updated 4 years ago
- Instruction Tuning with GPT-4β4,337Jun 11, 2023Updated 2 years ago
- Code base for internal reward models and PPO trainingβ24Oct 1, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-β¦β566May 9, 2024Updated last year
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023β12Dec 13, 2023Updated 2 years ago
- Adversarial Lipschitz Regularizationβ10Jun 10, 2021Updated 4 years ago
- Instruct-tune LLaMA on consumer hardwareβ18,945Jul 29, 2024Updated last year
- A collection of libraries to optimise AI model performancesβ8,344Jul 22, 2024Updated last year
- Information Extraction related tools and modelsβ10Mar 16, 2023Updated 3 years ago
- This repository contains the data and code for the paper "SideControl: Controlled Open-domain Dialogue Generation via Additive Side Netwoβ¦β12Dec 1, 2021Updated 4 years ago