minichatgpt - To Train ChatGPT In 5 Minutes
β166Jul 29, 2023Updated 2 years ago
Alternatives and similar repositories for minichatgpt
Users that are interested in minichatgpt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ChatLLaMA π’ Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPTβ1,201Jan 18, 2025Updated last year
- LLaMA: Open and Efficient Foundation Language Modelsβ2,784Nov 8, 2023Updated 2 years ago
- ZYN: Zero-Shot Reward Models with Yes-No Questionsβ35Aug 15, 2023Updated 2 years ago
- Official implementation of the ACL Findings 2023 paper: Interpretable Automatic Fine-grained Inconsistency Detection in Text Summarizatioβ¦β14Jan 25, 2024Updated 2 years ago
- Storyfinder - A Browser Plugin and Server Backend for Personalized Knowledge- and Information Managementβ18Jan 7, 2026Updated 4 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits β’ AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- BiLLa: A Bilingual LLaMA with Enhanced Reasoning Abilityβ416Jun 1, 2023Updated 2 years ago
- This is a sample about how to run stanford_alpaca on Amazon SageMaker, only for demo use.β14Jul 3, 2023Updated 2 years ago
- Due to restriction of LLaMA, we try to reimplement BLOOM-LoRA (much less restricted BLOOM license here https://huggingface.co/spaces/bigsβ¦β183Jun 18, 2023Updated 2 years ago
- πLLaMA Demo 7Bπβ17Mar 23, 2023Updated 3 years ago
- 4 bits quantization of LLaMA using GPTQβ3,072Jul 13, 2024Updated last year
- A seq2seq with attention dialogue/MT model implemented by TensorFlow.β11Jul 17, 2018Updated 7 years ago
- Unsupervised spoken sentence embeddingsβ14Dec 14, 2022Updated 3 years ago
- β16Jul 29, 2022Updated 3 years ago
- deep learningβ150May 6, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- β456Oct 15, 2023Updated 2 years ago
- A fork of textgen that kept some things like Exllama and old GPTQ.β22Aug 20, 2024Updated last year
- β25May 23, 2023Updated 2 years ago
- Implementation of paper accepted by EMNLP 2018 using Pytorch named "A Self-Attentive Model with Gate Mechanism for Spoken Language Undersβ¦β17Dec 11, 2018Updated 7 years ago
- An opensource ChatBot built with ExpertPrompting which achieves 96% of ChatGPT's capability.β301May 31, 2023Updated 2 years ago
- Code and data for the COLING 2020 paper "Try to Substitute: An Unsupervised Chinese Word Sense Disambiguation Method Based on HowNet"β14Dec 2, 2020Updated 5 years ago
- Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adβ¦β6,083Jul 1, 2025Updated 10 months ago
- β536Dec 1, 2023Updated 2 years ago
- Finetuning LLaMA with RLHF (Reinforcement Learning with Human Feedback) based on DeepSpeed Chatβ118Jun 5, 2023Updated 2 years ago
- AI Agents on DigitalOcean Gradient AI Platform β’ AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Quantized inference code for LLaMA modelsβ13Mar 12, 2023Updated 3 years ago
- Open Multilingual Chatbot for Everyoneβ1,276Jun 8, 2025Updated 11 months ago
- Multi-language Enhanced LLaMAβ302Apr 13, 2023Updated 3 years ago
- Create Persona dataset from reddit en movie category commentβ11Aug 6, 2021Updated 4 years ago
- Code base for internal reward models and PPO trainingβ24Oct 1, 2023Updated 2 years ago
- Instruction Tuning with GPT-4β4,337Jun 11, 2023Updated 2 years ago
- Implementation of ChatGPT RLHF (Reinforcement Learning with Human Feedback) on any generation model in huggingface's transformer (blommz-β¦β564Apr 23, 2026Updated 2 weeks ago
- Offical code repository for PromptMix: A Class Boundary Augmentation Method for Large Language Model Distillation, EMNLP 2023β12Dec 13, 2023Updated 2 years ago
- Using tensorflow/serving to deploy kashgari model for time training and predicting.β13Sep 16, 2019Updated 6 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Let's Sample Step by Step: Adaptive-Consistency for Efficient Reasoning with LLMsβ41Jan 30, 2024Updated 2 years ago
- Adversarial Lipschitz Regularizationβ10Jun 10, 2021Updated 4 years ago
- High-performance control stack for Embodied AI powered by the OpenClaw ecosystem. Designed for high-dynamic platforms including Humanoidsβ¦β41Feb 16, 2026Updated 2 months ago
- This repository is the official implementation of the TRAC optimizer in Fast TRAC: A Parameter-Free Optimizer for Lifelong Reinforcement β¦β35May 2, 2025Updated last year
- Instruct-tune LLaMA on consumer hardwareβ18,937Jul 29, 2024Updated last year
- A collection of libraries to optimise AI model performancesβ8,349Jul 22, 2024Updated last year
- Information Extraction related tools and modelsβ10Mar 16, 2023Updated 3 years ago