minichatgpt - To Train ChatGPT In 5 Minutes
β166Jul 29, 2023Updated 2 years ago
Alternatives and similar repositories for minichatgpt
Users that are interested in minichatgpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ChatLLaMA π’ Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPTβ1,201Jan 18, 2025Updated last year
- LLaMA: Open and Efficient Foundation Language Modelsβ2,782Nov 8, 2023Updated 2 years ago
- ZYN: Zero-Shot Reward Models with Yes-No Questionsβ35Aug 15, 2023Updated 2 years ago
- Official implementation of the ACL Findings 2023 paper: Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarizatioβ¦β14Jan 25, 2024Updated 2 years ago
- Showcasing various NLP Downstream tasks Training with pre-trained Language models using Pytorch Lightningβ13Aug 7, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- BiLLa: A Bilingual LLaMA with Enhanced Reasoning Abilityβ417Jun 1, 2023Updated 3 years ago
- This is a sample about how to run stanford_alpaca on Amazon SageMaker, only for demo use.β14Jul 3, 2023Updated 2 years ago
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigsβ¦β183Jun 18, 2023Updated 2 years ago
- A Java library of animated image-to-image transitions useful for slide shows, photo montages, and UI transitions.β19Jan 28, 2020Updated 6 years ago
- Codebase for Hyperdecoders https://arxiv.org/abs/2203.08304β14Oct 11, 2022Updated 3 years ago
- 4 bits quantization of LLaMA using GPTQβ3,072Jul 13, 2024Updated last year
- A seq2seq with attention dialogue/MT model implemented by TensorFlow.β11Jul 17, 2018Updated 7 years ago
- Duotone image effects in Pythonβ11May 4, 2022Updated 4 years ago
- Unsupervised spoken sentence embeddingsβ14Dec 14, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI β’ AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- AI Research Agent is a versatile application that leverages multiple tools to conduct thorough research on any topic.β12Oct 12, 2024Updated last year
- β17Nov 27, 2023Updated 2 years ago
- β16Jul 29, 2022Updated 3 years ago
- deep learningβ150May 6, 2025Updated last year
- β456Oct 15, 2023Updated 2 years ago
- A fork of textgen that kept some things like Exllama and old GPTQ.β22Aug 20, 2024Updated last year
- β25May 23, 2023Updated 3 years ago
- Implementation of paper accepted by EMNLP 2018 using Pytorch named "A Self-Attentive Model with Gate Mechanism for Spoken Language Undersβ¦β17Dec 11, 2018Updated 7 years ago
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.β299May 31, 2023Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Code and data for the COLING 2020 paper "Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet"β14Dec 2, 2020Updated 5 years ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adβ¦β6,079Jul 1, 2025Updated 11 months ago
- β535Dec 1, 2023Updated 2 years ago
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chatβ117Jun 5, 2023Updated 3 years ago
- Quantized inference code for LLaMA modelsβ13Mar 12, 2023Updated 3 years ago
- Soft 404 (dead page) detector in Pythonβ13Oct 1, 2018Updated 7 years ago
- Open Multilingual Chatbot for Everyoneβ1,274Jun 8, 2025Updated last year
- Scripts to add text to imagesβ10Mar 7, 2019Updated 7 years ago
- Multi-language Enhanced LLaMAβ302Apr 13, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Create Persona dataset from reddit en movie category commentβ11Aug 6, 2021Updated 4 years ago
- Code base for internal reward models and PPO trainingβ24Oct 1, 2023Updated 2 years ago
- Instruction Tuning with GPT-4β4,335Jun 11, 2023Updated 2 years ago
- Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-β¦β564Apr 23, 2026Updated last month
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023β12Dec 13, 2023Updated 2 years ago
- Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMsβ41Jan 30, 2024Updated 2 years ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement β¦β35May 18, 2026Updated 3 weeks ago